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Coronavirus: keep 
sharing research 


Researchers must ensure that their work on 
this outbreak is shared rapidly and openly. 


wenty thousand cases; more than 400 lives lost. 

The coronavirus first reported last December is 

nowa public-health emergency of international 

concern. In China, cities have been sealed off, 

and the authorities have built an entire new 
hospital in Wuhan, where the outbreak started. 

Along with medical workers, the country’s researchers 
are playing a vital part. Epidemiologists are working to 
update estimates of case numbers; genome samples of 
the pathogen are being sequenced and results are being 
shared. 

In two papers in Nature, teams led by researchers at 
the Wuhan Institute of Virology and at Fudan University, 
Shanghai, confirm that the virus is similar to the one that 
caused severe acute respiratory syndrome (SARS), and 
that there’s evidence it originated in bats. The Wuhan team 
analysed viral-genome samples from a small number of 
patients, all of whom worked at the animal market from 
which the first cases were reported (P. Zhou et al. Nature 
http://doi.org/ggj5cg; 2020). The Fudan team sequenced a 
sample from one infected market worker (F. Wu etal. Nature 
http://doi.org/dk2w; 2020). 

In the first days after the outbreak became known, we 
confirmed that reporting research and data will in no way 
affect consideration of submissions to Nature. Nature and 
its publisher Springer Nature have nowsigned a joint state- 
ment with other publishers, funders and scientific societies 
to ensure the rapid sharing of research data and findings 
relevant to the coronavirus. In the statement, we commit 
to working together to help ensure that: 

- All peer-reviewed research publications relevant to the 
outbreak are made immediately open access, or freely avail- 
able at least for the duration of the outbreak. 

Research findings relevant to the outbreak are shared 
immediately with the World Health Organization (WHO) 
upon journal submission, by the journal and with author 
knowledge. 

Research findings are made available via preprint serv- 
ers before journal publication, or via platforms that make 
papers openly accessible before peer review, with clear 
statements regarding the availability of underlying data. 

* Researchers share interim and final research data 
relating to the outbreak, together with protocols and 
standards used to collect the data, as rapidly and widely 
as possible — including with public health and research 
communities and the WHO. 

The priority now is to stop the virus’s spread and 
help those affected. That includes understanding how 
the virus is transmitted between people, ramping up 
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supplies of diagnostic equipment and accelerating vaccine 
development. Beyond this, questions are being asked about 
whether there were delays in sounding the alarm. Answer- 
ing this honestly is necessary if we are to learn lessons for 
next time. It’s also essential to improve regulation of animal 
markets, because lax oversight increases the risk of new 
viruses transferring from animals to humans. And funds 
must be released for better disease surveillance in the poor- 
est countries — the main reason the WHO declared the virus 
a public-health emergency of international concern. 

For researchers, the message is simple: work hard to 
understand and combat this infectious disease; make that 
work of the highest standard; and make results quickly 
available to the world. 


Cancer genomics 
gets new focus 


To realize the full potential of cancer genomics 
studies, tumour sequence data needs to be 
paired with clinical background information. 


his week, Nature is publishing a suite of papers 
that sheds new light on the genetic causes of 
cancer. The results show how far our under- 
standing of cancer has come — and how far we 
still have to go. 

The Pan-Cancer Analysis of Whole Genomes Consortium 
brought together researchers with nearly 750 affiliations 
across 4 continents. Between them, they sequenced full 
genomes from more than 2,600 samples representing 
38 different types of cancer. The work is summarized ina 
News & Views article on page 39. 

The project is remarkable in both scope and complexity, 
and, partly because of this, faced challenges at every step; 
from acquiring samples to protecting patient privacy while 
putting terabytes of data into the hands of researchers. 

Thanks to these efforts — and previous full-genome 
sequences — scientists now have an unprecedented view 
of the genetic changes that can contribute to cancer, anda 
clearer idea of where gaps in knowledge remain. Altogether, 
the team pinpointed 705 mutations that occurred repeat- 
edly inthe cancer genomes, suggesting that they are impor- 
tant for tumour growth. Of these, about 100 fell outside 
the protein-coding regions of the genome, but more such 
mutations might be uncovered with improvements in com- 
putational techniques for analysing non-coding regions. 
Overall, the authors found that cancer genomes contain an 
average of four to five mutations that drive tumour growth. 
In 5% of cases, however, they found no such mutations. 

Cancer genomes have been sequenced for more than a 
decade, but nowresearchers and the funders who support 
them must tackle the next challenge. The goal has always 
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been to improve the lives of those affected by cancer, and 
the reams of data amassed by sequencing projects have 
helped. They are used by researchers to find new drug 
targets, and to generate new markers that can be used to 
match patients with the treatment most likely to help. 

But most of the data so far have been limited in one 
crucial respect: clinical details of the sample donors are 
often missing. The first samples collected for the Cancer 
GenomeaAtlas, asequencing project that ran from 2006 to 
2018, co-funded by the US National Cancer Institute and the 
National Human Genome Research Institute, typically came 
with little more than the donor’s gender, diagnosis and age 
at diagnosis. Rarely would there be arecord of that person’s 
family or medical history, what therapy they had received 
and how they had responded — all crucial information if 
genome sequences are to be put to work to help patients. 

The next generation of cancer-genome sequencing pro- 
jects is trying to change that. But gathering detailed clinical 
information is more difficult — and more expensive — than 
sequencing genomes, particularly in the many countries 
that lack a unified health-care system. There, accessing 
hospital records is complicated: different hospitals keep 
records differently; patients often move from one treat- 
ment centre to another; and the quality of records varies 
enormously. More-detailed records also mean greater risk 
of personal exposure if there is a privacy violation, raising 
the bar yet again for participant protection. 

These are all pressing issues, not only in cancer research, 
but in health care generally. Efforts are already under way 
totransform health records into a format that can be more 
readily, but securely, accessed and studied. The Ameri- 
can Association for Cancer Research’s project GENIE, for 
example, has compiled 70,000 records of tumour DNA 
sequences, and real-world clinical data. The United King- 
dom’s 100,000 Genomes Project also aims to match DNA 
sequences with clinical information for a variety of condi- 
tions. And the International Cancer Genome Consortium, 
which has coordinated much of the tumour sequencing 
work so far, has launched a new phase, this time with a focus 
onclinical information. 

Pooling large numbers of samples is a powerful way to 
find genetic changes that can drive cancer, and provides 
a starting point for learning how they do so. But the real 
return on investment will come when that information 
can be used to tailor therapy to individual patients. And 
for that to be achieved, clinical background information 
on study participants is essential. 

When cancer-genome sequencing projects were first 
launched, it was hoped that they would provide a catalogue 
of mutations that could give rise to cancer — and reveal 
broad patterns on which researchers could base drug devel- 
opment. The core of that mission has been achieved, but 
many cancers have proved more complex than expected. 
Seemingly similar cancers can contain very different sets 
of mutations — no two cancers are quite the same. 

Asis often the casein biomedical research, the answers to 
a question are more complex than originally imagined. But 
recognizing the complexity is empowering, and harness- 
ing it will be necessary in the search for better treatments. 
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Read all about it 


Nature will trial the publication 
of peer-review reports. 


esearch communities are unanimous in 

acknowledging the value of peer review, but 

there’s a growing desire for more transparency 

inthe process. As part of that, researchers want 

to see how publishing decisions are made, and 
they want greater assurance that referees and editors act 
with integrity and without bias. 

For many journals, including Nature, peer review has 
typically been single-blind — that is, authors do not know 
who is reviewing their paper. At the same time, the con- 
tents of peer-review reports, and correspondence between 
authors, reviewers and editors, are kept confidential. 

This prevents readers from seeing the often fascinating 
and important discussions between authors and reviewers, 
which are crucial in shaping and improving research and 
checking its integrity. Keeping these debates confidential 
also helps to reinforce perceptions that the research paper 
is the last word on a subject — when the latest finding is 
often simply a milestone along the scholarly journey. 

Our authors have told us they want change. In a 2017 
survey of Nature referees, 63% of respondents said pub- 
lishers should experiment with alternative models, and 
more than half said peer review could be more transparent. 

Four years ago, Nature invited referees to be acknowl- 
edged in papers — with the consent of both author and 
reviewer. Around 3,700 Nature referees have chosen to 
be publicly recognized, and around 80% of the journal’s 
papers have at least one referee named. 

Beginning this week, authors of new submissions to 
Nature will be offered the option to have anonymous 
referee reports published, along with their own responses 
and rebuttals, once a manuscript is ready for publication. 

Those who agree to act as reviewers should know that 
their anonymous reports — and their anonymized corre- 
spondence with authors — might be published. Referees 
can also choose to be named, should they desire. 

In making this change, Nature is following seven other 
Nature Research journals. And we're joining the pioneer- 
ing efforts of The EMBO Journal and BMC journals — and, 
more recently, Nature Communications, which has been 
publishing reviewer reports since 2016. 

We will report back as the trial progresses, but the expe- 
rience of Nature Communications has been positive. In 
2018, the overwhelming majority (98%) of the journal’s 
authors who had published their reviewer reports told us 
they would do so again. 

Published peer reviews are intended to advance schol- 
arly discussion about a piece of research and it is important 
that our readers and the research community at large can 
benefit from such discourse. We are pleased to be playing 
asmall part in making that happen. 
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A personal take on science and society 


World view 


By Gail Cardew 


People will not trust 
unkind science 


Amean and aggressive research working 
culture threatens the public’s respect for 
scientists and their expertise, says Gail Cardew. 


arlier this month, a survey from Wellcome 

in London confirmed that unkindness, and 

worse, is pervasive in science (see go.nature. 

com/2v4fn3w). Academic leaders expressed 

alarm — both for the health of young research- 
ers and for how such pressure could erode the quality of 
science. I think there is more to worry about. 

What hopeis there for those in science to build atrusting 
and respectful relationship with the public when so many 
scientists are schooled ina culture lacking these qualities? 

The need for trust and respect is particularly acute 
now, when people, as the British politician Michael Gove 
infamously put it, “have had enough of experts”. Similar 
arguments have come from around the world. 

According to a 2019 report by public-opinion research 
firm Ipsos Mori, the way people behave, especially their 
ability to think of others’ interests, influences their 
trustworthiness. Competence is not enough (go.nature. 
com/37lydga). This is backed up by asurvey of people living 
onpotentially contaminated land, which found that citizens 
whosaid they did not trust the underlying science were not 
questioning scientists’ expertise, but whether scientists 
shared the public’s interest (go.nature.com/2giuvyb). 

Unkindness in science saddens me for many reasons. 
Obviously, I feel for the devoted researchers who began 
their careers expecting to revel in the joy of discovery, only 
to find their love of the subject squeezed out, replaced by 
fear and anxiety. It also saddens me because I’ve witnessed 
some of this toxic culture spill out of the laboratory, into 
scientists’ dealings with the public. 

Ihave spent decades examining the relationship between 
science, culture and society, most recently as director of 
science and education at the Royal Institution of Great 
Britain in London, heading ateam that connected leading 
scientists with the public — in person, online, on television 
and in the classroom. I have long believed that scientists 
have a duty to discuss their work and its implications. 

Conducting research responsibly includes engaging with 
the public. More and more researchers are now making that 
effort: speaking at science festivals, giving public talks and 
visiting schools. They often describe notjust their research, 
but how amazing it is to be a scientist, with the opportu- 
nity to think about the many unanswered questions facing 
humanity. 

Most speakers take an interest in their audience and give 
thoughtful, sensitive answers to audience questions. But 
some become confrontational at any remark interpreted 
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as questioning their expertise. Some dismiss questions 
they deem irrelevant or stupid. Some take umbrage if 
there are no questions, sometimes mistaking diffidence 
fora lack of interest. Those who do engage, but unkindly, 
can make matters worse. In one toe-curlingly awkward 
case in Europe, aspeaker berated women in the audience 
for not asking questions. I'd bet that that audience subse- 
quently felt less consideration both for science and for the 
importance of policies informed by it. 

Irecall howa child asked Ellen Stofan, then NASA’s chief 
scientist, how useful Lego blocks would be to get to Mars. 
Stofan’s warm, inspiring answer: “Everything we do at 
NASA, someone has to imagine first,” she said. “You have 
to learn to be creative, to be innovative, and that’s why 
the arts are animportant part of education.’ I’m sure that 
response left her listeners with a higher esteem for science. 
And Fields Medal winner Cédric Villani once told me this of 
public engagement: “It reinvigorates you. It also helps you 
to understand what you are doing and why you are doingit.” 

Wellcome’s survey found that nearly four-fifths of 
researchers think competition has created unkind and 
aggressive conditions. Most (61%) have witnessed bullying 
or harassment, and 43% have experienced it themselves. 
Only 37% feel comfortable speaking up. 

InaNature poll following up onthe survey, large majori- 
ties said that institutions, funders and lab heads should be 
the ones responsible for changing the culture (go.nature. 
com/36j4yar). If we want to build trust in science and scien- 
tists, itis not enough to think about ‘what’ we achieve; we 
must think about ‘how we influence those around us. That’s 
why, when I left the Royal Institution, the farewell message 
that meant the most commended me for having achieved 
amazing things in a way that was “kind and humane’. 

Ahumane environment comes about through decisions, 
notluck. Make time for regular reflection on how you could 
have handled situations better, and have the courage to 
admit that swiftly to those concerned. Ask people for 
guidance, especially those supposedly less experienced, 
and definitely those less powerful, because they can often 
provide afresh perspective. Give people time to learn and 
grow; recognize when they need help and also when they 
need to be left alone to make their own way, including mis- 
takes. Above all, everyone should feel able to bring their 
whole selves to work, where differences in lives and back- 
grounds are celebrated, where unique perspectives and 
contributions are valued and not interpreted as criticism. 

A kinder research culture will build stronger, deeper 
support for research, as well as higher-quality science. 
Maintaining public trust should not mean shouting more 
loudly ina noisy world. Instead, let’s look at our own behav- 
iour and ask ourselves — are we really acting in the best 
interests of others? 
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The world this week 


News in brief 


CORONAVIRUS PAPERS APPEARING RAPIDLY 
AS RESEARCHERS RESPOND TO OUTBREAK 


More than 70 research studies 
onthe new coronavirus have 
been released over the past 
few weeks, as scientists rush to 
understand the pathogen and 
how it spreads. 

The virus, known as 
2019-nCoV, causes a serious 
respiratory illness and has 
so far infected more than 
20,000 people and killed at 
least 400, according to reports 
as Nature went to press. It has 
also spread to multiple other 
countries. The infection is 
thought to have originated ina 
food market in the Chinese city 
of Wuhan, which has been on 
lockdown — with travel into and 
out of the city restricted — since 
23 January. 

The escalating outbreak has 
prompted a flurry of research 
activity on the coronavirus, 
which emerged in humans 
last December and is new to 
science. Nature searched for 
studies about the virus using 


the terms ‘coronavirus’ or ‘ncov’ 


onthe preprint servers bioRxiv, 
medRxiv and ChemRxiv, as 
well as on Google Scholar, the 
discussion forum virological. 
org, scholarly-activity tracker 


Altmetric and the websites of 
institutions that had published 
preliminary research reports on 
the subject. As of 4 February, at 
least 77 English-language papers 
onthe coronavirus have been 
published (see ‘Coronavirus 
research’). 

More than half of the studies 
are on preprint servers, anda 
handful have appeared in peer- 
reviewed journals, including 
The Lancet and the Journal of 
Medical Virology. The search did 
not include Chinese-language 
journals. 

Several of the papers contain 
estimates of how rapidly the 
virus spreads, or the length of its 
incubation period — how long 
after being infected with the 
virus people start to experience 
symptoms. 

Other studies focus on the 
virus’s structure or genetic 
make-up — information that 
could be used to identify drug 
targets or develop a vaccine. 
Researchers have also published 
genomic data onthe virus 
ononline platforms such as 
GISAID or GenBank, but Nature’s 
analysis did not count these data 
uploads. 


CORONAVIRUS RESEARCH 


Dozens of studies about the virus have been published since the outbreak began. 


Cumulative studies 
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HARVARD CHEMISTRY 
CHIEF'S ARREST 
STUNS SCIENTISTS 


Researchers have reacted 

with shock to the arrest of top 
nanoscientist Charles Lieber, 
who has been charged with lying 
to the US government about 
receiving funding from China. 

Lieber, who leads the 
chemistry department 
at Harvard University in 
Cambridge, Massachusetts, 
was arrested on 28 January and 
released on bail two days later. 

Colleagues of Lieber said 
they were shocked by his 
detainment. “I have 100% 
trust and confidence in him. 
Ithink there must be some 
misunderstanding,” says 
Xiaocheng Jiang, a former 
student of Lieber who is now 
at Tufts University in Medford, 
Massachusetts. 

The federal charges focus on 
Lieber’s alleged involvement 
in China’s Thousand Talents 
Plan, a programme designed 
to recruit academics to 
the country. They come as 
US authorities increasingly 
scrutinize universities’ foreign 
ties, amid fears that countries 
might be stealing or influencing 
US research. The FBl alleges 
that Lieber received hundreds 
of thousands of dollars froma 
Chinese university and agreed 
to lead alab there — but that 
he denied his involvement 
when asked by US government 
agencies. Lieber’s legal team did 
not respond to Nature’s requests 
for comment. 
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The highest- 
resolution 
image of 
the Sun ever 
taken 
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The world’s most powerful solar telescope has opened 
its eyes. The US$344-million Daniel K. Inouye Solar 
Telescope, which has been two decades in the making, 
is scrutinizing the Sun in extraordinary detail from atop 
Haleakala mountain in Hawaii. 

Images released on 29 January show patterns of 
superheated gas churning on the Sun’s surface. Bright 
‘cells’ represent the plasma rising from deeper in the 
star, and the darker borders between the cells indicate 
where plasmais cooling and sinking. 

The 4-metre Inouye telescope eclipses what had been 
the world’s largest solar telescope, a1.6-metre facility 
at Big Bear Solar Observatory in southern California. 
Scientists say that the dramatic upgrade will transform 
solar physics for decades. The Inouye Solar Telescope 
will make the most precise measurements of the Sun’s 
magnetic field so far, including the first-ever magnetic 
measurements in the Sun’s atmosphere, or corona. 

‘It’s going to be sucha revolution,’ says Momchil 
Molnar, a solar physicist at the University of Colorado 
Boulder. 


PROMISINGHIV 
VACCINE FAILSIN 
LARGE TRIAL 


The quest to develop a vaccine 
against HIV has been dealt a 
setback. Researchers running a 
trial of aonce-promising vaccine 
in South Africa have stopped 
administering immunizations 
after an analysis showed that 
the vaccine was not effective. 
The study’s sponsor, the US 
National Institute of Allergy and 
Infectious Diseases, announced 
the trial’s cancellation on 

3 February. 

The trial, called HVTN 702, 
enrolled 5,407 people who 
did not have HIV, and they 
received either the vaccine ora 
placebo injection. The vaccine 
that participants received 
was similar to one that, ina 
previous trial in Thailand, had 
reduced infections by about 
30% compared with the trial’s 
placebo group. That marked 
the first-ever success for an HIV 
vaccine ina large trial, albeit a 
modest one. 

But an independent board 
that was monitoring interim 
data from the South Africa trial 
determined that, after most of 
the volunteers had been in the 
study for 18 months or more, 
the vaccine was not protecting 
participants from HIV infection. 
Among the 2,694 people who 
received the immunization, 129 
contracted HIV; 123 of the 2,689 
participants who received the 
placebo tested positive for HIV. 
Researchers will continue to 
follow the volunteers and try 
to determine why the vaccine 
failed. 


NDIABETSBIG 
ONQUANTUM 
TECHNOLOGIES 


Quantum technology has 

been given a massive boost 

in India’s latest budget, 
receiving 80 billion rupees 
(US$1.12 billion) over 5 years as 
part of anew national quantum 
mission. 

The move places India 
alongside the United States and 
Europe, which in the past few 
years have each pledged more 
than US$1 billion to research in 
the field. Russia also announced 
an initiative worth hundreds of 
millions of dollars late last year. 

India’s investment — 
announced on 1 February 
by finance minister Nirmala 
Sitharaman (pictured) — willbe 
administered by the ministry 
of science and technology, and 
is aconsiderable increase on 
past commitments. A national 
quantum-technology research 
programme announced in 2018 
received US$27.9 million over 
S years. 

Ashutosh Sharma, secretary 
of the ministry’s department 
of science and technology, 
says India’s quantum research 
is solid on the theoretical 
side, but needs infrastructure 
and experimental facilities. 
The mission will develop 
quantum technologies for 
communications, computing, 
materials development 
and cryptography, and 
will coordinate the work of 
scientists, industry leaders and 
government departments, he 
says. 
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The world this week 


News in focus 


Workers disinfect areas of Qingdao, China, as part of virus-control measures. 


WHAT'S NEXT FOR THE 
CORONAVIRUS OUTBREAK 
CAUSING GLOBAL ALARM 


Many aspects of the new virus remain unknown, but scientists 
are weighing up the best- and worst-case scenarios for the future. 


By Dyani Lewis 


shealth authorities around the world 
race to halt the spread of a new virus 
that emerged in the Chinese city of 
Wuhan in December, scientists are 
considering what is likely to happen 
next, and how bad the outbreak might get. As 
Nature went to press, more than 20,000 peo- 
ple had contracted the coronavirus, which 
causes respiratory illness; the death toll was 
426, and rising daily. Crucial details about the 
virus and howit spreads are still unknown, but 
scientists are considering best- and worst-case 
scenarios on the basis of previous epidemics 
and what they already know about this one. 


How many people will be infected? 

Chinese authorities have locked down cities 
at the centre of the epidemic, and research- 
ers were quick to share data on the virus with 
the World Health Organization (WHO) and 
other scientists. But the case numbers have 
been rising. According to one prediction, 
the virus could infect about 46,000 of the 30 
million people living in and around Wuhan 
(see go.nature.com/31m9fzz). In the best 
case, fewer people will be infected because 
the effects of the control measures will start 
kicking in, says Ben Cowling, an epidemiolo- 
gist at the University of Hong Kong. Butit’s too 
early to tell whether efforts to quarantine peo- 
ple, and the widespread use of face masks, are 
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working. The incubation period for the virus 
— up to 14 days — is longer than most control 
measures have been in place, he says. 

Ina worst-case scenario, some 190,000 peo- 
ple could be infected in Wuhan, according to 
another model (J. M. Read et al. Preprint at 
medRxiv http://doi.org/dkzb; 2020). Scien- 
tists are particularly concerned about fresh 
outbreaks emerging outside China. The 
virus has already spread in small, localized 
clusters in Vietnam, Japan, Germany and 
the United States, but authorities have been 
quick toisolate the people affected. More than 
150 cases had been recorded outside China 
as of 3 February, including one death in the 
Philippines. 
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News in focus 


Is the virus here to stay? 

When a virus circulates continuously inacom- 
munity, it is said to be endemic. The viruses 
that cause chicken pox and influenza are 
endemic in many countries, for example. If 
efforts to contain the coronavirus fail, and if 
the virus can be passed on by people who are 
infected but don’t have symptoms, it will be 
more difficult to control its spread, making it 
more likely that it will become endemic, too. 

As with influenza, this could mean that 
deaths occur every year as the virus circulates, 
until a vaccine is developed. But if control 
measures are effective, and transmission slows 
so that each infected person infects no more 
than one other person, the current outbreak 
could simply peter out, says Cowling. 

There have been several cases of infected 
people displaying no symptoms, but it’s still 
unclear whether such asymptomatic or mild 
cases are common, and whether or to what 
extent they are infectious. Asymptomatic 
cases set the new virus apart from the related 
coronavirus that causes severe acute respira- 
tory syndrome (SARS). There was a global 
outbreak of the SARS virus in 2002-03, but 
it usually spread only once people were ill 
enough to need hospital care, and was even- 
tually contained. There is no evidence that 
the virus is still circulating in humans, says 
lan Mackay, a virologist at the University of 
Queensland in Brisbane, Australia. 


Is the new virus likely to change? 


Currently, it has caused severe illness, and 
death, mainly in older people, particularly 
those with pre-existing health conditions. 
Some researchers are worried that the 
pathogen could mutate so it can spread more 
efficiently, or become more likely to cause 
disease in young people. 

But Kristian Andersen, aninfectious-disease 
researcher at Scripps Research in La Jolla, 
California, says this is unlikely. Viruses con- 
stantly mutate, he says, but those mutations 
don’t typically make the virus more virulent or 
cause more serious disease: most mutations 
are detrimental to the virus or have no effect. 
A 2018 study of the SARS virus in primate cells 
found that a mutation it sustained during the 
2003 outbreak probably reduced its virulence 
(D. Muth etal. Sci. Rep. 8, 15177; 2018). 

Researchers have shared dozens of genetic 
sequences from strains of the new corona- 
virus, and asteady supply of those sequences 
will reveal genetic changes as the outbreak 
progresses, says Mackay. 


How many people will it kill? 


The fatality rate for a virus — the proportion 
of infected people who die — is difficult to cal- 
culate in the middle of an outbreak because 
records are constantly being updated. With 
about 400 deaths so far out of more than 
20,000 infections, the new coronavirus has a 
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death rate of 2-3%. This is significantly lower 
than SARS, which killed around 10% of infected 
people. And the known death rate for the new 
coronavirus is likely to decrease as mild and 
asymptomatic cases are identified, says virolo- 
gist Mark Harris at the University of Leeds, UK. 

The death toll will also depend on how 
China’s health systems cope with the number 
of cases. Putting people on drips and ventila- 
tors canensure that they get enough fluids and 
oxygen while their immune systems fight the 
virus. If the virus spreads to regions with few 
resources, suchas parts of Africa, health sys- 
tems could struggle, says Sanjaya Senanayake, 
an infectious-disease specialist at the Austral- 
ian National University in Canberra. 

On 30 January, the WHO declared the out- 
break a “public-health emergency of inter- 
national concern”. WHO director-general 
Tedros Adhanom Ghebreyesus said his main 


concern was that the outbreak could spread 
to countries with fragile health systems. 

There are currently no effective drugs 
against the virus, but two HIV drugs are being 
tested as a treatment, and several research 
groups are working ona vaccine. 

The current death rate of 2-3% — although 
notas high as that of SARS — is still high for an 
infectious disease, says Adam Kamradt-Scott, 
a global health-security specialist at the 
University of Sydney, Australia. The 1918 
influenza outbreak infected around half a 
billion people, one-third of the world’s pop- 
ulation at the time, and killed more than 2.5% 
of those infected; some have estimated that 
as many as 50 million people died. The new 
coronavirus probably won't trigger such an 
apocalyptic scenario, because it isn’t typically 
infecting or killing young, healthy people, says 
Kamradt-Scott. 


CORONAVIRUS: LABS 
WORLDWIDE SCRAMBLE 
TO ANALYSE SAMPLES 


Scientists need the pathogen to 


develop tests, drugs and vaccines. 


By Ewen Callaway 


ith no sign that an outbreak of a 
new coronavirus is abating, virol- 
ogists worldwide are itching to 
get physical samples of the virus. 
They are planning to test drugs 
and vaccines, develop animal models of the 
infection and investigate the virus’s biology. 

“The moment we heard about this outbreak, 
we started to put our feelers out to get access 
totheseisolates,” says Vincent Munster, a virol- 
ogist at the US National Institute of Allergy and 
Infectious Diseases centre in Hamilton, Mon- 
tana. His lab is expecting to receive a sample 
soon fromthe US Centers for Disease Control 
and Prevention in Atlanta, Georgia. 

The first lab to isolate and study the virus, 
known as 2019-nCoV, was at the Wuhan 
Institute of Virology — in the city where the 
outbreak started. A team led by virologist 
Zheng-Li Shi isolated the virus from a woman 
who developed symptoms in December. 
Shi’s team found that the virus can kill cul- 
tured human cells and that it enters them 
through the same molecular receptor as the 
coronavirus that causes SARS (severe acute 
respiratory syndrome; P. Zhou et al. Preprint 
at bioRxiv http://doi.org/ggjs7d; 2020). 

An Australian lab said on 28 January that it 
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had obtained virus samples from an infected 
person who had returned from China, and it 
was preparing to share them with other sci- 
entists. Labs in France, Germany and Hong 
Kong are also isolating and preparing to 
share virus samples taken from local patients, 
says Bart Haagmans, a virologist at Erasmus 
Medical Center in Rotterdam, the Netherlands. 
Haagmans hopes to receive viral material from 
one of these labs in the coming days. 

Several labs have sequenced the virus, but 
scientists say that the results are no substi- 
tute for live samples, which are needed to test 
drugs and vaccines, and to study the virus in 
depth. Munster says that his priority will be 
to identify animals that experience the infec- 
tionina similar way to humans. These will be 
useful for developing vaccines and drugs. The 
team first plans to look at a mouse genetically 
engineered to contain a human version of the 
receptor that the coronavirus uses to infect 
cells. Future work could involve exposing mice 
and non-human primates to the virus and test- 
ing whether vaccines can prevent infection. 

Munster’s lab is also eager to start gauging 
how long the virus can survive in the air or in 
saliva droplets. This could help epidemiolo- 
gists to understand whether the virus can be 
transmitted through the air, or only through 
close contact. 


OMER MESSINGER/EPA-EFE/SHUTTERSTOCK 


SOCIAL SCIENTISTS 
BATTLE BOTS TO GLEAN 
INSIGHTS ONLINE 


Automated production of social-media 
posts can confound research studies. 


By Heidi Ledford 


ocial-media bots that pump out 
computer-generated content have 
been accused of swaying elections 
and damaging public health by spread- 
ing misinformation. Now, some social 
scientists have a fresh accusation: bots med- 
dle with research studies that mine popular 
sites such as Twitter, Reddit and Instagram for 
information on human health and behaviour. 
Data from these sites can help scientists 
to understand how natural disasters affect 
mental health, why young people have flocked 
to e-cigarettes in the United States and how 
people join together in complex social net- 
works. But such work relies on distinguishing 
the real voices from the automated ones. 
“Bots are designed to behave online like peo- 
ple,” says Jon-Patrick Allem, a social scientist 
at the University of Southern California in 
Los Angeles. “If a researcher is interested in 
describing public attitudes, you have to be 
sure that the data you're collecting on social 
media is actually from people.” 
Computer scientist Sune Lehmann designed 
his first bots in 2013, as a social-network exper- 
iment for a class that he was teaching at the 


Technical University of Denmark in Kongens 
Lyngby. Back then, he says, Twitter bots were 
simple, obscure and mainly meant to increase 
the number of followers for specific accounts. 
Lehmann wanted to show his students how 
such bots could manipulate social systems, 
so together they designed simple bots that 
impersonated fans of the singer Justin Bieber. 

The ‘Bieber Bots’ quickly attracted thou- 
sands of followers. But social-media bots 
have continued to evolve, becoming more 
complex and harder to detect. They surged 
into the spotlight after the 2016 US presiden- 
tial election — amid accusations that bots had 
been deployed on social media in an attempt 
to sway the vote. “All of a sudden, it became 
something of interest to people,” Allem says. 

Since then, Allem has shown that tweets 
generated by bots are twice as likely as their 
real counterparts to attest that e-cigarettes 
help people to give up smoking". Bots are 
also more likely to tout the unproven health 
benefits of cannabis”. These studies rely on 
algorithms that estimate the likelihood that 
a Twitter account is automated. But despite 
bot-detecting tools with names like BotSlayer, 
Allem says that many social-science and pub- 
lic-health researchers still fail to filter out 


Separating real online voices from automated ones can be a problem for researchers. 
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probable automated content from their data. 

That omission can pollute a data set, says 
Amelia Jamison, who studies health dispari- 
ties at the University of Maryland in College 
Park and has mined social media for posts that 
oppose vaccination. “You might be artificially 
giving the bots a voice by treating them as if 
they are really part of the discussion, when they 
are actually just amplifying something that 
may not be voiced by the community,’ she says. 

One problem that the field must grapple 
with is howto define a bot, says Katrin Weller, 
aninformation scientist at the Leibniz Institute 
for the Social Sciences in Cologne, Germany. 
Not all bots are dispensing misinformation: 
some provide data from weather stations, 
or general news updates. Some researchers 
define Twitter bots as those accounts that send 
out more than a certain number of messages 
each day — a loose definition that could rope 
in prolific human tweeters. 

Other definitions are more complex, but 
bot detectors are locked in an arms race with 
bot developers. First-generation social-me- 
dia bots were relatively simple programs that 
retweeted others’ posts at regular intervals. 
Now, advances in machine learning have 
enabled the creation of more sophisticated 
bots that post original content. Some post at 
random intervals and mimic human patterns, 
such as not tweeting when a person would 
probably be asleep. Some developers will mix 
in human-generated content with automated 
content to better camouflage their bots. 

“Once you know more about the bots and 
how to detect them, then this knowledge 
is also available for the bot creators,” says 
Oliver Griibner, who studies quantitative 
health geography at the University of Zurich 
in Switzerland. “It’s a really tricky field.” 

Like Lehman, some social scientists are 
creating their own bots to conduct social 
experiments. Kevin Munger, a political scien- 
tist at Pennsylvania State University in Univer- 
sity Park, and his colleagues built bots that 
chided Twitter users who used racist language. 
One set of bots had profile pictures of white 
men; the other had profile pictures of black 
men. Munger found that Twitter users were 
more likely to tone down their racist rhetoric 
after being called out by bots with a white male 
profile picture’. 

After his Bieber Bot success, Lehmann 
designed more sophisticated bots to study 
how behaviours spread from one group to 
another. But bots now have such a bad repu- 
tation that he is leaning towards abandoning 
the approach, for fear of a public backlash. “1 
kind of thought: ‘I'll find another quiet cor- 
ner and do my research without courting 


controversy’,” he says. 
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Newsin focus 


How quickly can Iran 
make a nuclear bomb? 


News 


explainer 


With an international deal in jeopardy, Iran 
is not racing to build a nuclear weapon — 
but its capabilities are growing again. 


lran has accumulated 1,200 kilograms of 
enriched uranium — more than doubling 
the stockpile it had just three months ago, 
according to a statement from a senior 
official at the Atomic Energy Organization 
of Iran on 25 January. 

That’s enough to build one atomic bomb, 
if the uranium is further refined to make 
it weapons-grade — a process that could 
take just two to three months, says David 
Albright, a nuclear-policy specialist at 
the Institute for Science and International 
Security in Washington DC. Building actual 
weapons would take much longer, he adds. 

If confirmed, the rate of expansion of 
lran’s uranium stockpile “shifts things 
dramatically”, Albright adds. But he and 
others say that there is no evidence that 
Iran is rushing to build a bomb — for now. 

Tensions between Iran and the United 
States have been escalating. On 3 January, 
a US drone strike killed Qasem Soleimani, 
the architect of Iran’s military involvement 
in the Middle East. In response, Iran shot 
missiles at US bases in Iraq. 

The Joint Comprehensive Plan of Action 
(JCPOA), the 2015 deal between Iran and 
six global powers that limited its nuclear 
capabilities in exchange for the lifting of 
economic sanctions, is now in serious 
jeopardy. US President Donald Trump 
pulled out of the deal in May 2018, and Iran 
announced in May last year that it would 
resume uranium enrichment. 

Nature talked to nuclear experts to find 
out how soon Iran could build a bomb, and 
whether this is likely to happen. 


Has Iran tried to build nuclear weapons 
before? 

Building nuclear weapons is expensive 
and requires technical expertise in 
enriching uranium. The fissionable isotope 
uranium-235, which makes up less than 
1% of natural uranium, must be separated 
from uranium-238, which is by far the 
more common isotope. Iran has had an 
active nuclear programme for decades. 
The country has always maintained that 
this was purely for peaceful purposes, 


18 | Nature | Vol 578 | 6 February 2020 


such as producing isotopes for medical use. 
But in the early 2000s, Iran seemed to have 
a programme to build at least five uranium 
fission bombs, according to US intelligence 
assessments and international observers. 
Reports in the mid-2000s by the United 
Nations International Atomic Energy Agency 
(IAEA) suggested that Iran might have been 
actively working to build a nuclear arsenal. 
That would be a violation of the 1968 Treaty 
on the Non-Proliferation of Nuclear Weapons 
(NPT), which Iran has signed. In 2003, bowing 
to international pressure, the country agreed 
to cut down its nuclear activities drastically — 
but not completely. 


Did the 2015 deal reduce Iran's nuclear 
capabilities? 
By 2015, the country had stockpiled 11 tonnes 
of uranium hexafluoride enriched to as much 
as 20% *°°U — weapons-grade uranium must 
be enriched to 90%. Uranium hexafluoride 
in processed in for enrichment in gas form, 
in high-speed centrifuges, and in 2015 Iran 
had more than 10,000 of these centrifuges. 
When the JCPOA was signed in July that 
year, experts estimated that the country 
was months — perhaps weeks — away from 
producing weapons-grade uranium. 

But the JCPOA forced Iran to ship most 


of its stockpile abroad, and to mothball 

the majority of its centrifuges. The aim 

was partly to stretch the time Iran needed 

to stockpile enough fissile material for a 
bomb — known as ‘breakout time’ — to at 
least a year. The deal also subjected Iran to a 
stringent regime of IAEA inspections. 


What was the impact of the US withdrawal 
from the nuclear deal? 
Seyed Hossein Mousavian, who was a 


“Tensions between Iran 
and the United States 
have been escalating.” 


spokesperson for Iran’s nuclear negotiating 
team in 2003, says that Iranians feel cheated. 
The perception in the country is that “you 
cannot negotiate with or trust the US”, says 
Mousavian, now a nuclear-policy specialist at 
Princeton University in New Jersey. 


Does Iran now have enough enriched 
uranium to build nuclear bombs? 

Last November, the IAEA found that Iran 
had accumulated around 550 kilograms 
of uranium hexafluoride that was 


Iran’s Bushehr nuclear power plant has been in operation for almost ten years. 


© 2020 Springer Nature Limited. All rights reserved. 


MAJID ASGARIPOUR/AP/SHUTTERSTOCK 


“moderately enriched” to less than 4.5% 
288. It is unclear what material the Iranian 
official was referring to in his 25 January 
claim, but it is presumed to be 1,200 
kilograms of moderately enriched uranium 
hexafluoride. If further enriched, this could 
yield more than 30 kilograms of weapons- 
grade uranium, enough to build one fission 
bomb. 


How quickly could Iran make a bomb once 
it has enough weapons-grade uranium? 
Possessing fissile material is not enough: 

a country also has to master the design 

and manufacture of a bomb. In particular, 
uranium hexafluoride must be converted to 
uranium metal, which is not straightforward, 
says Richard Johnson, a proliferation 
specialist at the Nuclear Threat Initiative, a 
policy research centre in Washington DC. 
According to Albright, some intelligence 
agencies estimate that it could take Iran 
about two years to make its first two bombs if 
it wanted to do this. 


If the nuclear deal is scrapped, will Iran be 
legally entitled to go nuclear? 

No. Because Iran has signed the NPT, it is 
committed to using nuclear technology 
exclusively for peaceful purposes. 
Members of the NPT must allow the 

IAEA to verify their compliance, or face 
consequences. But Iran could withdraw 
from the NPT, as North Korea did in 2003, 
as it was becoming a nuclear power. 
Iranian foreign minister Mohammad Javad 
Zarif said on 20 January that the country 
is prepared to withdraw if its continued 
enrichment programme is reported to the 
UN Security Council. 


So is Iran actively working towards a 
nuclear bomb? 

“All the signs are that they are not,” says Zia 
Mian, a physicist and nuclear-policy expert 
at Princeton University. The country has 
complied with the rigorous IAEA inspection 
regime set out in the JCPOA. This means 
that a nuclear-weapons programme is 
“either hidden so well that no one has been 
able to find it so far, or that there is no 

such crash programme”, he says. Albright 
agrees, saying that Iran could be stockpiling 
enriched uranium to increase its leverage 

in future negotiations. “You don’t see some 
of the indicators that would imply a well 
worked-out decision” to actually build 
bombs, he says. But expanding stockpiles of 
enriched uranium brings the country closer 
to being able to make a nuclear bomb — if 
it wishes. 


By Davide Castelvecchi 


AFRICAN SCHIZOPHRENIA 
STUDY IDENTIFIES 
DAMAGING MUTATIONS 


Genetic studies of mental illness have largely been 
conducted in people with European ancestry. 


By Alison Abbott 


he first genomic analysis of schizo- 
phrenia in an African population has 
identified multiple rare mutations that 
occur more frequently in people with 
the condition. 

The mutations are mainly in genes that are 
important for brain development and the 
brain’s synapses, structures that coordinate 
communication between neurons. The find- 
ings, published on 31January (S. Gulsuner etal. 
Science 367, 569-573; 2020), match those of 
other schizophrenia studies — but nearly all 
previous research has been conducted in 
European or Asian populations. 

This research is important because Africa 
has represented a big gap in the populations 
that geneticists have studied, says psychiatric 
geneticist Andreas Meyer-Lindenberg, direc- 
tor of the Central Institute of Mental Health 
in Mannheim, Germany. He says that the work 
lends support to current hypotheses about the 
biological origins of schizophrenia, which can 
cause hallucinations and delusions. Research- 
ers think that each mutation contributes a 
small amount to the overall risk of developing 
the condition, and that disruption to synapses 
could be crucial to the disease’s development. 

Geneticists have long been criticized for fail- 
ing to sample diverse populations for genomic 
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studies, which have largely neglected African 
people. “This urgently needs more attention,” 
says Ambroise Wonkam, a human geneticist 
at the University of Cape Town, South Africa. 

This bias means that diagnostic tests and 
treatments developed on the basis of these 
studies might not workin certain populations. 
But studies in diverse populations also allow 
researchers to build up a fuller picture of dis- 
eases. In particular, African people as a group 
have genomes that are more diverse than those 
of other populations because the vast majority 
of human evolution took place in Africa. 

The study enrolled around 900 people with 
schizophrenia and a similar number of con- 
trols. All identified as Xhosa, members of an 
ethnic group who live mainly in South Africa. 

The researchers sequenced the participants’ 
genomes and searched for mutations that 
damage genes. Such mutations were much 
more prevalent in people with schizophrenia 
than in the control individuals, and were con- 
centrated in genes that are highly expressed 
in the brain or involved in synapse function. 

The results echo those of a large Swedish 
study (G. Genovese et al. Nature Neurosci. 19, 
1433-1441; 2016) that used the same methods, 
but the density of mutations in affected genes 
were generally larger inthe Xhosa participants. 
The authors say that this reflects the greater 
genetic variation among Xhosa people. 


Nature | Vol 578 | 6 February 2020 | 19 


CHRISTOPHER FURLONG/GETTY 


Feature 


THENEW 
3D PRINTING 


Researchers are developing techniques 
to print faster, bigger and weirder. 
By Mark Zastrow 
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sametal platform rises froma vat of 
liquid resin, it pulls an intricate white 
shape from the liquid — like a waxy 
creature emerging from a lagoon. 
This machine is the world’s fastest 
resin-based 3D printer and it can 
create a plastic structure as large as 
a person ina few hours, says Chad 
Mirkin, a chemist at Northwestern University in 
Evanston, Illinois. The machine, which Mirkin 
and his colleagues reported last October’, is 
one of aslew of research advances in 3D print- 
ing that are broadening the prospects of a 
technology once viewed as useful mainly for 
making small, low-quality prototype parts. 
Not only is 3D printing becoming faster and 
producing larger products, but scientists are 
coming up with innovative ways to print and 
are creating stronger materials, sometimes 
mixing multiple materials inthe same product. 


RELATIVITY SPACE 


A metal printer at start-up firm 
Relativity Space, which aims to test 
a mostly 3D-printed rocket this year. 


Sportswear firms, aviation and aerospace 
manufacturers and medical-device compa- 
nies are eager to take advantage. “You're not 
going to be sitting in your home, printing out 
exactly what you want to repair your car any 
time soon, but major manufacturing com- 
panies are really adopting this technology,” 
says Jennifer Lewis, a materials scientist at 
Harvard University in Cambridge, Massa- 
chusetts. 

The latest techniques could be lucrative 
for researchers, many of whom — Lewis and 
Mirkin among them — are already commer- 
cializing their work. They’re also fundamen- 
tally exciting, says lain Todd, a metallurgist 
at the University of Sheffield, UK. “We can get 


performance out of these materials that we 
didn’t think we could get. That’s what’s really 
exciting toa materials scientist. This is getting 
people used to the new weird.” 


From trinkets to products 


The 3D printing technique is also referred 
to as ‘additive manufacturing’, because 
instead of chopping or milling a shape out 
of a larger block, or casting molten material 
ina mould, it involves building objects from 
the bottom up. Its advantages include less 
waste and an ability to print custom designs, 
such as intricate lattice structures, that are 
otherwise hard to create. Low-cost hobbyist 
machines print by squeezing out thin plastic 
filaments from heated nozzles, building up a 
structure layer by layer — a method known as 
fused deposition modelling (FDM). But the 
term 3D printing encompasses a much wider 
range of techniques. One of the oldest uses 
an ultraviolet laser to scan across and solidify 
(or ‘cure’) light-sensitive resin, layer by layer. 
That concept was described as far back as 1984, 
ina patent filed by Charles Hull’, the founder 
of acompany called 3D Systems in Rock Hill, 
South Carolina. 

The latest techniques — including Mirkin’s 
— still use light-sensitive resin, but are faster 
and larger-scale, following improvements 
reported in 2015 by a team led by Joseph 
DeSimone, a chemist and materials scientist 
at the University of North Carolina at Chapel 
Hill’. Early printers were slow, small-scale and 
prone to producing layered, imperfect and 
weak structures. These founda niche in rapid 
prototyping, making plastic model parts as 
mock-ups for later production by conventional 
methods. As an area of research, this kind of 
printing wasn’t thrilling, says Timothy Scott, 
a polymer scientist at Monash University 
in Melbourne, Australia: “Basically making 
trinkets and knick-knacks. For a polymer 
chemist, it was pretty dull.” 

Then DeSimone unveiled a way to print 
light-sensitive resin up to 100 times faster 
than conventional printers’. It uses a stage 
submerged in a vat of resin. A digital projec- 
tor shines a pre-programmed image up at the 
stage through a transparent window in the 
floor of the vat. The light cures an entire resin 
layer at once. DeSimone’s advance was to make 
the window permeable to oxygen. This kills the 
curing reaction and creates a thin buffer layer, 
or ‘dead zone’, just above the window's surface 
so that the resin doesn’t stick to the bottom of 
the vat each time a layer is printed. The stage 
rises continually, pulling the completed part 
up through the liquid as new layers are added 
at the bottom. 

Other labs were working on similar 
concepts at the time, says Lewis. But perhaps 
most impressive about DeSimone’s resins was 
that they could undergo a second reaction in 
apost-print heat treatment to strengthen the 
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finished product. “It opens up a much broader 
array of materials,” says Lewis. 

Many research groups and firms have since 
built on the work. Mirkin’s printer pumps a 
layer of clear oil across the bottom of the vat 
to inhibit the polymer’s reactions. This also 
acts as a coolant, removing heat that can 
deform a printed part — and it means that 
the equipment is not limited to printing with 
resins that are inhibited by oxygen. He says 
the printer produces material ten times faster 
than DeSimone’s. And last January, Scott and 
his colleague Mark Burns at the University of 
Michiganin Ann Arbor reported a printer that 
inhibits the reactions by mixing into the resin 
a chemical that can be activated by a second 
lamp emitting a different wavelength of light’. 
By varying the ratio of the strength of the two 
light sources, the researchers can control the 
thickness of the photo-inhibited zone, allow- 
ing the creation of more complicated patterns, 
suchas surfaces embossed with seals or logos. 

Inventions in 3D printing often have rapid 
commercial potential: some researchers start 
forming companies before they publish their 


“We can get performance 
out of these materials that 
we didn’t think we could get.” 


advances. On the same day DeSimone’s paper 
was published, for instance, he showcased 
it at a TED talk in Vancouver, Canada, and 
officially launched his start-up firm Carbon 
3D in Redwood City, California, although 
he had quietly registered the company two 
years earlier. The firm is now one of the big- 
gest start-ups in 3D printing; it has already 
raised US$680 million in publicly disclosed 
funding rounds, and is reportedly valued at 
$2.4 billion. It has high-profile contracts with 
Adidas to make rubber-like midsoles for ath- 
letic shoes, and with sports-gear firm Riddell 
to manufacture customized helmet padding 
for American-football players. 

Mirkin and his colleagues James Hedrick and 
David Walker have also launched a start-up, 
Azul 3D in Evanston, Illinois, to commercial- 
ize their technique, which they have dubbed 
HARP (high-area rapid printing). And Scott and 
Burns are preparing acommercial prototype 
printer with their Ann Arbor-based start-up 
Diplodocal, aname derived from the Greek 
for ‘double beam’. 

New resin-printing techniques are still 
emerging. One begins with a small spinning 
glass holding liquid resin. As the glass rotates, 
a projector shines a loop of video onto it 
that corresponds to 2D slices of the desired 
object. Within seconds, the final object solid- 
ifies inside the liquid resin — no layers neces- 
sary’. The method is inspired by X-rays and 
computed-tomography scans, which image 


Nature | Vol578 | 6 February 2020 | 21 


Feature 


a cross-section of a solid object. This is the 
inverse: back-projecting cross-sections to 
forma 3D object. 

Even in this fast-moving field, the technique 
turned heads for what Lewis calls “the gee-whiz 
factor”. It has significant limitations: the resin 
used must be transparent, and the printed 
object must be small enough for light to pass 
through it to cure it. But it also has a potential 
advantage: it can handle highly viscous res- 
ins, which other resin-based printers struggle 
to suck through the narrow dead zone. That 
means it could make stronger materials and 
more accurate prints. 

The approach has garnered substantial 
interest from industry, says Christopher 
Spadaccini, a materials and manufacturing 
engineer at Lawrence Livermore National 
Laboratory (LLNL) in California. Spadaccini 
was a member of the team that published the 
idea last January’, although a group at the Swiss 
Federal Institute of Technology in Lausanne 
(EPFL) independently developed the same con- 
ceptand reported it ina preprint afew months 
beforehand®. Spadaccini thinks the technology 
has tremendous commercial potential because 
it has modest hardware requirements. “In the 
end, really, what you need is a halfway-decent 
projector and a rotating stage,” he says. 


Going big 

While chemists work on smarter ways to 
3D-print intricate resins, engineers are push- 
ing boundaries in 3D printing of concrete 
— using computers and robots to precisely 
automate the pouring process. 

The world’s first 3D-printed concrete 
pedestrian bridge was made by researchers 
at the Institute for Advanced Architecture of 
Catalonia in Barcelona, Spain, and installed 
ina parkin Alcobendas, near Madrid, in 2016. 
Twelve metres long, the bridge features a lat- 
tice structure designed with algorithms that 
maximize strength and reduce the amount 
of material needed. Other teams have made 
similar structures, including a26-metre-long 
bridge in Shanghai, China, produced by engi- 
neers at Tsinghua University in Beijing. And 
teams and companies in China and the Nether- 
lands have 3D printed demonstration houses. 

Those structures aren’t constructed in one 
print job, however: separate segments are 
printed and then connected. By producing 
bridges and houses more cheaply and effi- 
ciently, 3D printing could reduce concrete’s 
carbon footprint — but it could also just 
encourage engineers to build more. 

It’s not just concrete that is going big: 
Amsterdam firm MX3D has printed a bridge 
from stainless steel. First displayed publicly 
in 2018, the bridge is now being tested and 
having sensors installed ahead of a planned 
installation over an Amsterdam canal. And 
California start-up firm Relativity Space in Los 
Angeles says it is constructing a nearly fully 
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3D-printed rocket. The rocket is designed to 
lift 1,250 kilograms into low Earth orbit, and 
its first test launch is slated for the end of this 
year. Printed metal doesn’t always have the 
same heat-dissipating performance as non- 
printed metal, says Relativity Space’s chief 
executive, Tim Ellis, but the printing pro- 
cess can add cooling channels in geometries 
that can’t usually be manufactured. Because 
rockets are used only once or perhaps a few 
times, they don’t have to be as strong in the 
long term as do alloys in aeroplane parts, 
which must resist failure over tens of thou- 
sands of pressure cycles, Ellis says. 

These large-scale metal-printed projects are 
built with robot arms that feed a thin metal 
wire to a laser that welds the material into 
place. Other established ways to print metal 
use a laser or a beam of electrons to melt or 


“There are still some 
big challenges with 
4D printing.” 


Aresin printer from Chad Mirkin’s lab at 
Northwestern University in Illinois can create 
structures as large as a person in hours. 


fuse a bed of powder into layers of finished 
product. Another technique binds a bed of 
powder with liquid glue, then sinters the struc- 
ture ina furnace. And printers designed inthe 
past few years extrude molten metals through 
nozzles, in much the same way as in FDM. 

Aviation firms such as Boeing, Rolls Royce 
and Pratt & Whitney are using 3D printing to 
make metal parts, mainly for jet engines. It can 
be cheaper than milling metal blocks, and the 
intricate components often weigh less than 
their conventionally made counterparts. 

But 3D-printed metals are prone to defects 
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that can weaken the final products. Spadaccini 
and others are trying to use arrays of sensors 
and high-speed cameras to watch for irregu- 
larities suchas hotspots of heat or strain — and 
then make adjustments in real time, he says. 

Many scientists are also hoping to improve 
the intrinsic strength of printed metals, some- 
times by controlling the microstructures of 
the materials. For instance, in October 2017, 
a US team reported that the intense heat and 
rapid cooling used in 3D-printing stainless 
steel could alter the metal’s microstructure 
such that the product is stronger than those 
cast conventionally’. And two months ago, 
researchers in Australia and the United States 
reported atitanium-copper alloy with similar 
strength advantages®. As they solidified, previ- 
ous 3D-printed titanium alloys tended to form 
grains that grew in column-like structures. 
The copper helps to speed up the solidifica- 
tion process, which results in grains that are 
smaller and sprout in all directions, strength- 
ening the overall structure. 

Mark Easton, a materials engineer at RMIT 
University in Melbourne and one of the leaders 
of the alloy work, has already had conversa- 
tions with aerospace companies interested 
in exploring uses for the material. He says it 
could also be used in medical implants such 
as joint replacements. 

Many of the printing techniques that work 
for metals can also be applied to ceramics, 
with potential applications that include mak- 
ing dental crowns or orthopaedic implants. 
Moulds for these objects are already made 
by 3D printing, with the material cast in the 
conventional way. But 3D-printing the entire 
object could save time at the dentist or 
surgeon’s office. 

However, it is harder to control the 
microstructure of 3D-printed ceramics, says 
Eduardo Saiz, a materials scientist and cerami- 
cist at Imperial College London. And nearly all 
practical ceramic printing techniques involve 
extensive post-print sintering that can warp 
or deform the part. “In my opinion, ceramics 
is way behind polymers and metalsin terms of 
practical applications,” he says. 


Change over time 


The field’s future could also lie in ‘4D printing’ 
—3D-printed objects that also have the ability 
to perform some mechanical action, akin to 
artificial muscles. Often, these incorporate 
shape-memory polymers, materials that can 
react to changes in their environment suchas 
heat or moisture. 

In May 2018, researchers at the Swiss Fed- 
eral Institute of Technology (ETH) in Zurich 
and the California Institute of Technology 
in Pasadena reported printing a submarine 
that propels itself forward using paddles that 
snap backwards when placed in warm water”. 
The work could lead to microrobots that can 
explore the oceans autonomously. But for the 
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These multi-material print heads can switch between printing hard and soft materials in one object. 


moment, the paddles must be reset after each 
stroke. Such devices could use battery power to 
reset themselves, but that makes the machine 
less efficient than one made conventionally, 
says Geoff Spinks, a materials engineer at the 
University of Wollongong in Australia. “There 
are still some big challenges with 4D printing,” 
he says. 

Another approach to 4D-printed devices 
involves triggering the action with a chang- 
ing external magnetic field. US researchers 
have 3D-printed lattice structures filled with 
aliquid that changes stiffness in response to 
a magnetic field’® — which could perhaps be 
used to help car seats stiffen on impact. 

Other, more passive potential 4D printing 
applications include stents, which could be 
compressed to be implanted then expanded 
onreaching the desired site in a blood vessel to 
prop it open. LastJuly, researchers in Switzer- 
land and Italy described a4D-printed stent that 
is just 50 micrometres wide”, much smaller 
than conventional ones. The devices are so 
small, the team says, they could one day be 
used to treat complications in fetuses, such 
as Strictures in the urinary tract, which can 
sometimes be fatal. 

Perhaps the most ambitious example of 
4D printing is matter that not only moves, 
but is alive. Currently, techniques for such 
bioprinting can print tissue, such as human 
skin, that is suitable for lab research, as well 
as patches of tissue for livers and other organs 
that have been successfully implanted in rats. 
But such techniques are still far from ready 
to integrate into a human body. Researchers 
dream of printing fully functioning organs 


that could alleviate long wait lists for organ 
donors. “I personally feel we’re a decade-plus 
away from that, at least, if ever,” says Lewis. 


All together now 


Many inventive ideas about printing matter 
that moves or changes rely on printing 
multiple materials together. “That’s absolutely 
where the field is heading,” says Scott. 

Last November, Lewis and her lab described 
a printer that can rapidly switch between dif- 
ferent polymer inks or mix them as it prints 
a single object”. This means objects can be 
printed with both flexible and rigid parts. Lewis 
has spun off previous work on multi-material 
printers into a firm called Voxel8, a start-up in 
Somerville, Massachusetts. Her multi-material 
printer could help with the athletics wear that 
Voxel8 is developing, says Lewis. Wearable 
devices need to be flexible around joints while 
also having rigid parts to house electronics. 
Saiz calls the printer “beautiful work”, add- 
ing wistfully: “There’s nothing like that for 
ceramics or metal.” 

And in March 2018, a team led by Jerry Qi, 
a materials engineer at Georgia Institute of 
Technology in Atlanta, unveiled a four-in-one 
printer. This combines a nozzle that extrudes 
molten polymer with one that prints light-sen- 
sitive resin, ready to be cured by ultraviolet 
lamps or lasers, and two that print wires and cir- 
cuitry from tiny dots of metal”. The print heads 
work together to make integrated devices with 
circuits embedded on arigid board or insidea 
flexible polymer enclosure. Qisays his group is 
now collaborating with electronics companies 
interested in printing circuit-board prototypes 
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faster than conventional methods. 

It wasn’t as simple as bolting four different 
printers into one platform: the researchers 
also needed to develop software that would 
allow each print head to communicate withthe 
others and keep track of the progress. 

The field is still far from delivering on early 
visions of bringing mass manufacturing 
into people’s homes. For now, sophisticated 
printers are too expensive to appeal to 
non-specialists. But 3D printing has comea 
long way in the past 20 years. Todd remembers 
people touring his lab in the early 2000s to 
see his technique to fuse specks of metal dust 
together to grow parts. Compared with the con- 
ventional milling machines and metal-cutting 
systems in neighbouring labs, his 3D-printing 
machines struck visitors as acomplete oddity. 
“It was like we were some sort of a dog playinga 
pianoinabar,’ he recalls. Now, for many firms, 
that trick is standard practice. 


Mark Zastrow is a writer based in Seoul. 
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As an alternative to CRISPR, RNA editing 
could offer flexible, reversible therapies. 


By Sara Reardon 


horsten Stafforst found his big 
break at the worst possible time. In 
2012, his team at the University of 
Tubingen in Germany discovered 
that by linking enzymes to engi- 
neered strands of RNA, they could 
change the sequences of messenger 
RNA molecules in cells. In essence, 
they could rewrite the genome’s instructions 
en route to making proteins. 

The process could theoretically serve to treat 
numerous diseases, both ones with genetic 
underpinnings and those that would benefit 
fromachangeinthe amount or type ofaprotein 
being produced. But Stafforst had alot of trou- 
ble getting the discovery published — it was sim- 
ply not interesting any more. His finding! was 


overshadowed by the discovery a few months 
earlier that the DNA-editing tool CRISPR-Cas9 
could be used to permanently alter the genome. 

Since then, CRISPR has become a fixture in 
the laboratory and has spawned a number of 
companies aimed at using the technology to 
develop drugs and treatments. With CRISPR 
sucking up all the attention, Stafforst says, 
people reacted to his paper with indifference. 
They asked, “Why do weneed this when there’s 
DNA editing?” 

But CRISPR editing — at least as atherapeutic 
technique in people — has turned out to be 
more difficult than initially thought. Research- 
ers have documented ways that Cas9, one of 
the enzymes used in CRISPR gene editing, 
could trigger immune responses, or cause 
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accidental changes to the genome that would 
be permanent. RNA editing, by contrast, could 
allow clinicians to make temporary fixes that 
eliminate mutations in proteins, halt their 
production or change the way that they work 
in specific organs and tissues. Because cells 
quickly degrade unused RNAS, any errors 
introduced by a therapy would be washed 
out, rather than staying with a person forever. 

Excitement over RNA editing is finally 
catching on. In 2019, researchers published 
more than 400 papers on the topic, according 
to data from Scopus, an abstract and citation 
database. A handful of start-up companies 
are beginning to use RNA-editing systems to 
develop potential treatments for everything 
from genetic diseases such as muscular dys- 
trophy to temporary maladies such as acute 
pain. And although RNA-based drugs have 
had difficulty reaching the market owing to 
challenges in delivery and tolerance, some reg- 
ulatory approvals in the past few years might 
help to pave the way for RNA-editing therapies. 

Several hurdles remain: current technol- 
ogies can alter RNA sequences in only a few 
limited ways, and getting the system to work 
as intended in the human body will prove 
challenging. Still, researchers hope that new 
technologies, such as protein engineering, 
and improved methods for delivering RNA to 
cells can help to overcome these limitations. 
“It really opens a world we haven't seen before,” 
Stafforst says. 


Arole for RNA 


A foundational tenet in molecular genetics — 
its central dogma — was that cellular machin- 
ery faithfully transcribes genetic information 
from a double-stranded DNA template into a 
single-stranded RNA messenger, whichis then 
translated into a protein. But in the 1980s, a 
handful of labs noticed that some mRNA tran- 
scripts contained altered or extra letters that 
were not encoded inthe DNA. The findings were 
controversial until scientists uncovered a fam- 
ily of enzymes called adenosine deaminases 
acting on RNA (ADARs). These proteins bind 
to RNAs and alter their sequence by changing 
a familiar base known as adenosine into a mol- 
ecule called inosine. Although not one of the 
canonical RNA bases, inosine is read by the cell’s 
protein-translation machinery as the familiar 
guanosine. A handful of other RNA-editing 
enzymes surfaced around the same time. 
Scientists have struggled over the past three 
decades to understand what exactly RNA 
editing accomplishes. The editors work only 
on double-stranded RNAs, which sometimes 
showupinthe cell as regulatory elements — or 
as viruses. Some have speculated that the ADAR 
proteins evolved as a defence against viruses, 
but many viruses with double-stranded RNA 
are unaffected by the enzymes. The editing 
might serve a regulatory function, but most 
adult tissues don’t produce the high levels of 
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the proteins required for the editing to occur. 

Brenda Bass, a biochemist at the Univer- 
sity of Utah in Salt Lake City, was among the 
first to identify ADARs in frog embryos’. She 
says that no one has found a specific role for 
the changes made to non-protein-coding 
RNAs, which account for the majority of 
edited molecules. The editing could serve 
to protect double-stranded RNAs from 
immune attack. Bass suspects that ADARs 
edit the double-stranded transcripts, adding 
inosines as a way of telling the body to leave 
them alone. The enzymes also seem to have 
arole in embryonic development: mice that 
lack ADAR genes die before birth or don’t live 
long after. The editors also seem to have some 
function in select tissues of adult organisms 
— suchas the nervous system of cephalopods. 

Itwas this activity that drew marine biologist 
Joshua Rosenthal to RNA editing in the early 
2000s. It seems that highly intelligent ceph- 
alopods, such as squid, cuttlefish and octo- 
puses, use RNA editing extensively to adjust 
genes involved in nerve-cell development 
and signal transmission. No other animals are 
known to use RNA editing in this way. Inspired 
by these observations, Rosenthal wondered 
whether it was possible to use the system to 
correct the messages produced by dysfunc- 
tional genes in a therapeutic setting. In 2013, 
his group at the University of Puerto Rico in 
San Juan re-engineered ADAR enzymes and 
attached them to guide RNAs that would bind 
toaspecific pointinan mRNA — creating a dou- 
ble strand. With these, they were able to edit 
transcripts in frog embryos, and evenin human 
cells in culture’. 

Similar to Stafforst, Rosenthal, now at the 
Marine Biological Laboratory in Woods Hole, 
Massachusetts, saw his publication mostly 
ignored. A similar fate, he learnt, had befallen 
the work of researchers at a company called 
Ribozyme, who in 1995 proposed ‘therapeu- 
tic editing’ of mutated RNA sequences by 
inserting complementary sequences into 
frog embryos and allowing ADARs to edit the 
resulting double-stranded molecule and cor- 
rect the mutation‘. 

Butinthe past several years, multiple factors 
have converged to bring Rosenthal’s and Staf- 
forst’s findings to the fore. Peter Beal, achemist 
at the University of California, Davis, says that 
the 2016 publication’ of the molecular structure 
of ADAR bound to double-stranded RNA made 
the system more understandable and enabled 
scientists to better engineer the enzyme to 
enhance its delivery or make it more efficient. 
And in 2018, the US Food and Drug Adminis- 
tration (FDA) approved the first therapy using 
RNA interference (RNAi): a technique in whicha 
small piece of RNA is inserted into a cell in which 
it binds to native mRNAs and hastens their deg- 
radation. The approval has opened the door for 
other therapies that involve mRNA interactions, 
says Gerard Platenburg, chiefinnovation officer 
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THE RNA CORRECTIONS 


Several RNA-editing proteins can work on 
double-stranded RNAs, including the messenger 
RNAs that encode proteins. They can make very 
specific single-letter changes: an enzyme called 
ADAR changes the base known as adenosine into an 
inosine molecule (which protein-producing 
machinery reads as a guanosine). 
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A protein called APOBEC1 can change cytosine into 
uracil. Researchers can use this enzyme to change 
the sequence of the resulting protein or add and 
remove sequences that prematurely halt the 
production of a protein. 
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of ProQR Therapeutics in Leiden, the Nether- 
lands, which is pursuing various RNA-based 
therapies. “Learning from the past, and with 
the number of approvals picking up, the field 
has matured alot,’ says Platenburg. 

Many see RNA editing as an important alter- 
native to DNA editing using techniques such as 
CRISPR. CRISPR technology is improving, but 
DNA editing can cause unwanted mutationsin 
other parts of the genome — ‘off-target effects’ 
— which might create new problems. 

Rosenthal expects, moreover, that RNA 
editing will prove useful for diseases without 
a genetic origin. He is currently using ADARS 
to edit the mRNA for a gene encoding the 
sodium channel Nav1.7, which controls how 
pain signals are transmitted to the brain. Per- 
manently changing the Nav1.7 gene through 
DNA editing could eliminate the ability to feel 
pain and disrupt other necessary functions of 
the proteinin the nervous system, but tuning it 
downthroughRNaA editing in select tissues for 
alimited amount of time could help to alleviate 
pain without the risk of dependency or addic- 
tion associated with conventional painkillers. 

Similarly, RNA editing could allow research- 
ers to mimic genetic variants that providea 
health advantage. For example, people with 
certain mutations in the gene PCSK9, which 
regulates cholesterol in the bloodstream, tend 
to have lower cholesterol levels, and modifying 
PCSK9 mRNA could confer a similar advantage 
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without permanently disrupting the pro- 
tein’s other functions. Immunologist Nina 
Papavasiliou of the German Cancer Research 
Center in Heidelberg says that RNA editing 
could be used to fight tumours. Some can- 
cers hijack important cell-signalling pathways, 
suchas those involved in cell death or prolifer- 
ation. If RNA editors could be conscripted to 
turn off key signalling molecules temporarily, 
she says, “we could see the tumour die”. Then, 
the patient could stop the therapy, allowing 
the pathway to resume its normal functions. 

As a treatment, RNA editing might be 
less likely to cause a potentially dangerous 
immune reaction than are CRISPR-based 
approaches. Unlike the DNA-editing enzyme 
Cas9, which comes from bacteria, ADARs are 
human proteins that don’t trigger an attack 
from the immune system. “You really don’t 
need heavy machinery to target RNA,” says 
Prashant Mali, a bioengineer at the University 
of California, San Diego. 

Ina paper published last year®, Mali and his 
colleagues injected guide RNAs into mice born 
with a genetic mutation that causes muscular 
dystrophy. The guide RNAs were designed to 
trigger production of a missing protein called 
dystrophin. Although the system edited onlya 
smallamount of the RNA encoding dystrophin, 
it restored the protein to about 5% of its normal 
level in the animals’ muscle tissue, an amount 
that has shown therapeutic potential. 

In other diseases that result from a missing 
or dysfunctional protein, such as some types 
of haemophilia, “it makes a huge difference 
to go from nothing to something”, Stafforst 
says, and it might not benecessary to edit RNA 
in every cell in the body. RNA editing might 
perform better than forms of gene therapy 
that would involve injecting a new gene. Mali 
and others say that directing native ADARs to 
operate onthe cell’s own mRNA might provide 
amore natural response than introducing an 
external, engineered gene. 

RNA-editing technology is far from perfect, 
however, even when it comes to laboratory 
applications. “It is early days,” Bass says. 
“There’s lots of questions.” Because ADARs are 
much less efficient than CRISPR, they could be 
less useful for making genetically modified 
plants and animals. “As a research tool, it’s 
very limiting,” says Jin Billy Li, a geneticist at 
Stanford University in California. 

Another major disadvantage is that ADARs 
can make only a few kinds of change to RNA. 
CRISPR systems act as scissors by cutting DNA 
at a designated spot and removing or insert- 
ing anew sequence; ADARs are more like an 
overwrite function that changes letters chem- 
ically, without breaking the RNA molecule’s 
‘backbone’. 

Although this process is less likely to cause 
unintended mutations, it limits the enzymes 
to making specific changes — adenosine to 
inosine in the case of ADARs, and cytosine to 


uridine by a set of enzymes called APOBECs 
(see ‘The RNA corrections’). There are a few 
other possibilities. Grape plants, for instance, 
can change cytidines to uridines, and some 
tumours can change guanosines to adeno- 
sines. “Biodiversity is giving us tons of answers 
to these things,” Rosenthal says. “I think down 
the line, things like the squid are going to teach 
us alot.’ But he says the field is understudied — 
researchers don’t understand the process that 
drives this editing. And it remains to be seen 
whether a plant enzyme, for instance, could 
function in human cells. 

Scientists are already looking for ways to 
engineer new enzymes that could expand 
RNA- editing capabilities. “It’s quite a process 
where you don’t know what you'll find,” says 
Omar Abudayyeh, a biological engineer at the 
Massachusetts Institute of Technology (MIT) 
in Cambridge. Working with Feng Zhang, a 
CRISPR pioneer at MIT, Abudayyeh and his 
colleagues linked an ADAR enzyme to Cas13 
(ref. 7). A bacterial enzyme similar to the 
CRISPR-associated protein Cas9, Cas13 cuts 
RNA instead of DNA. The researchers altered 
the sequence of the ADAR until it could con- 
vert cytidines to uridines. They then used the 
new system in human cells to change bases in 
mRNAs encoded by several genes, including 
APOE. One naturally occurring genetic vari- 
ant of this gene is associated with Alzheimer’s 
disease, and editing it could switch the variant 
to the harmless form. 

Abudayyeh and his MIT collaborator, bio- 
logical engineer Jonathan Gootenberg, admit 
itis possible that changing the ADAR protein 
could cause the immune system to stop rec- 
ognizing it as a natural human protein and 


attack cells that contain it. But they say that 
because these edits are small, this risk pales 
next to known concerns about the immune 
system attacking Cas13 or the virus used to 
deliver the editing tools into cells. 

Researchers see promise ina natural process 
called pseudouridylation, in which a set of pro- 
teinand RNA enzymes chemically modify the 
structure of uridines in mRNA. Unlike ADAR 
modifications, pseudouridylation doesn’t 
change the sequence ofthe mRNA or protein. 
Instead, for reasons that are not entirely clear, 
the process stabilizes the RNA molecule and 
causes the translation machinery to ignore 
signals instructing it to stop making protein. 

The ability to turn these molecular red lights 
into green lights could be powerful. Yi-Tao Yu, 
a biochemist at the University of Rochester in 
New York, says that hundreds of genetic dis- 
eases are caused by DNA mutations that cre- 
ate incorrect stop signals in mRNAs, resulting 
ina shortened protein that doesn’t function 
normally in the body. “The list is very long,” Yu 
says, and includes cystic fibrosis, the eye dis- 
ease Hurler’s syndrome and numerous cancers. 

Despite its early stage, researchers — and 
biotechinvestors — are excited about the wide 
potential of RNA editing. “I got into it way 
before it became cool,” says Papavasiliou, who 
is trying to map where natural ADARs work in 
the body. “For many years this was a backwater, 
and all of asudden there’s acompany popping 
up every two weeks.” 

Numerous start-ups and established 
DNA-editing firms have announced their inten- 
tion to move into RNA. They include Beam 
Therapeutics in Boston, Massachusetts, which 
was co-founded by Zhang and Liu and has been 
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developing CRISPR DNA editing as a therapy 
for several blood diseases. Locana, based in 
San Diego, is also pursuing CRISPR-based 
RNA editing that it hopes could treat condi- 
tions including motor-neuron disease and 
Huntington’s disease. 

The challenge for industry is to work out 
the best way to get the guide RNAs into the 
cell without triggering an immune reaction 
or causing the cell to degrade them. Beal says 
that this could include making strategic chem- 
ical modifications to the engineered RNAs that 
stabilize them, or embedding them inanano- 
particle or virus that can sneak into cells. 

And although ADARs are already in human 
cells, the human body makes only small 
amounts of them in most tissues, meaning that 
any therapy might need to add ADARs or other 
enzymes to boost cells’ editing capabilities. 
Packing viruses with the genes that encodeall 
the machinery needed for RNA editing might 
not be efficient. Many hope that it won’t be 
necessary. 

Platenburg hopes to add RNAs and rely 
on the naturally occurring ADARs to help to 
correct the lettering of mRNAs that contribute 
to retinal disorders. “We use the system given 
to us by nature and harness it,” he says. 

Researchers including Stafforst are engineer- 
ing guide RNAs with chemical modifications 
that attract ADARsin the cell to the editing site. 
But some researchers worry that conscripting 
the natural ADARs into editing specific mRNAs 
could pull them away from their normal tasks 
and cause other health problems. Altering 
gene expression in one part of the body could 
affect other parts in unforeseen ways. In Mali’s 
muscular-dystrophy study, for instance, mice 
developed liver problems for unknown reasons. 
“It’s atoolin development still,’ he says. 

“ADAR evolved to allow the body to mod- 
ify bases in a very targeted fashion,” says 
Nessan Bermingham, chief executive and a 
co-founder with Rosenthal and others of bio- 
technology company Korro Bioin Cambridge, 
Massachusetts. Bermingham is optimistic 
about the prospects of RNA editing, but cau- 
tious not to get ahead of the biology. “We have 
alot of work to do as we start to mature these 
techniques,” he says. “We’re not leaving any- 
thing off the table, but we have to recognize 
certain limitations.” 


Sara Reardon is a freelance journalist in 
Bozeman, Montana. 
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Sugar: multinationals have used the tobacco-industry playbook to stymie legislation aimed at cutting consumption. 


Truth decay: when 
uncertainty is weaponized 


From tobacco to food and fuels, industries use denial, 
deceit and doubt to corrupt. By Felicity Lawrence 


n2017, US presidential strategist Kellyanne 

Conway coined the phrase “alternative 

facts” to defend false claims about the size 

of the crowd at Donald Trump’s inaugura- 

tion. Numerous commentators lamented 
that we were entering a new era of Orwellian 
doublethink. 

These are indeed upside-down times, as 
epidemiologist and former safety regulator 
David Michaels demonstrates in his excoriat- 
ing account of the corporate denial industry, 
The Triumph of Doubt. Unwelcome news is 
automatically rebranded fake news. Inconven- 
ient evidence from independent sources — say, 
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about climate breakdown and fossil fuels, or 
air pollution and diesel emissions — is labelled 
junk science and countered with rigged studies 
claiming to be sound. 


The Triumph of 
Doubt: Dark Money 
and the Science of 
Deception 

David Michaels 
Oxford Univ. Press 
(2020) 
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But it would be wrong to see truth decay 
solely as the preserve of today’s populist 
politicians. Normalizing the production of 
alternative facts is a project long in the making. 
Consultancy firms that specialize in defending 
products from tobacco to industrial chemicals 
that harm the public and the environment 
have made a profession of undermining truth 
for decades. They hire mercenary scientists 
to fulfil a crucial role as accessories to their 
misrepresentations. 


Denial machine 


Michaels was among the first scientists to 
identify this denial machine, in his 2008 
book Doubt is Their Product. His latest work 
combines an authoritative synthesis of 
research on the denial machine published 
since then with his own new insights gleaned 
from battles to control the toxic effects of a 
range of substances. He takes on per- and poly- 
fluoroalkyls, widely used in non-stick coatings, 
textiles and firefighting foams; the harmful 
effects of alcohol and sugar; the disputed role 
of the ubiquitous glyphosate-based pesticides 
in cancer; and the deadly epidemic of addic- 
tion to prescribed opioid painkillers. In each 
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case, Michaels records how the relevant 
industry has used a toolbox of methods to 
downplay the risks of its products, spreading 
disinformation here, hiding evidence of harm 
there, undermining authorities — all tactics 
from the tobacco industry’s playbook. 

The doubt in the title of both Michaels’s 
books derives from a now-notorious memo 
written in 1969 by an unnamed executive at 
a subsidiary of British American Tobacco. It 
outlined a strategy for maintaining cigarette 
sales: “Doubtis our product since it is the best 
means of competing with the ‘body of fact’ that 
exists in the minds of the general public. It is 
also the means of establishing a controversy.” 
By creating scientific disinformation about 
links between tobacco and disease, this malign 
strategy delayed regulation by decades and 
protected corporate profits. 

Michaels’s insider perspective on the doubt 
machine dates back to 1998, when he became 
chief safety officer for nuclear-weapons 
facilities at the US Department of Energy 
during the administration of US president 
Bill Clinton. Here, he had a ringside view of 
the tricks used by vested interests to dispute 
established science, intimidate the authorities 
and scupper regulation. In his first book, he 
described how the ‘product defence industry’ 
applied the tobacco template to asbestos, 
lead, plastics and toxic materials such as 
beryllium used in nuclear applications. 

From 2009 to 2017, Michaels served as a 
senior regulator, appointed by president 
Barack Obama, in the Occupational Safety 
and Health Administration (OSHA). Here, he 
gathered even more material to show how 
deceptions have infected the body politic. 


Subverting the method 
The principles of scientific inquiry involve 
testing a hypothesis by exploring uncertainty 
around it until there is a sufficient weight of 
evidence to reach a reasonable conclusion. 
Proof can be much longer in coming, and 
consensus still longer. The product-defence 
industry subverts these principles, weaponiz- 
ing the uncertainty inherent in the process. 
Its tricks include stressing dissent where little 
remains, cherry-picking data, reanalysing 
results to reach different conclusions and 
hiring people prepared to rig methodologies 
to produce funders’ desired results. 
Michaels acknowledges other doubt 
scholarship. This includes that of science 
historians Naomi Oreskes and Erik Conway 
in Merchants of Doubt (2010); nutritional 
scientist Marion Nestle’s numerous books on 
the food industry, suchas Soda Politics (2015) 
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Aman prays in foam caused by pollution in the Yamuna River in New Delhi. 


and Unsavory Truth (2018); and journalist 
Jane Mayer’s 2016 Dark Money. That last book 
traced the funding that links climate-change 
denial to the libertarian right’s ideological 
drive to shrink the state and deregulate 
industry. 

Michaels names names fearlessly, pointing 
the finger at product-defence practitioners 
and the front groups and think tanks that 
masquerade as independent while taking 


“Creating scientific 
disinformation delayed 
regulation by decades and 
protected corporate profits.” 


industry’s shilling. Those wanting to check 
his allegations can find many previously 
unavailable source documents archived at 
the Triumph of Doubt Special Collection 
at https://toxicdocs.org. 


Emissions cheats 

So much of his material outrages, but two 
episodes stand out. Oneis the German car man- 
ufacturer Volkswagen’s brazen malfeasance 
regarding its diesel engines. The company 
developed secret software so these engines 
could cheat emissions tests, allowing its vehi- 
cles to fraudulently pass stringent US checks 
on the disease-causing particulates in diesel 
exhaust. This was unintentionally uncovered 
in 2014 by students working on behalf of the 
campaign group the International Council 
on Clean Transportation in Washington DC, 
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and confirmed the following year by the US 
Environmental Protection Agency. Michaels’s 
account of the scientists prepared to launder 
their data to abet this criminal activity is 
forensic. 

The second standout is his description 
of his years-long battle at the OSHA to 
reduce workers’ exposure to silica parti- 
cles from sand used in dozens of industries, 
from construction to steel manufacture and 
fracking. He and at least 50 staff members 
worked to collate evidence and counter a 
barrage of pseudoscientific objections and 
litigation. 

Michaels, no longer required to be a 
non-partisan government official, reserves 
special criticism for the Republican Party. He 
argues that corporate polluters and manu- 
facturers of dangerous products have long 
depended on the party to neuter public-health 
and regulatory agencies with phoney rheto- 
ric about liberty and free-market enterprise. 
He wants stronger regulation, not because 
he does not care about freedom, he says, but 
because we cannot be free without the state’s 
protection from harm. 

The Triumph of Doubtis at times dense with 
technical detail, of necessity as Michaels pros- 
ecutes his case against companies known 
to be litigious. It is a brave and important 
book, raising the alarm about the systemic 
corruption of science. 


Felicity Lawrence is special correspondent for 
The Guardian in London and author of Not on 
the Label and Eat Your Heart Out. 

e-mail: felicity.Lawrence@theguardian.com 
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Genomics: data sharing needs 
an international code of conduct 


Mark Phillips, Fruzsina Molnar-Gabor, Jan O. Korbel, Adrian Thorogood, Yann Joly, 
Don Chalmers, David Townend & Bartha M. Knoppers for the PCAWG Consortium. 


Efforts to protect people’s 
privacy ina massive 
international cancer project 
offer lessons for data sharing. 


ore than 800 terabytes of genomic 
data are available to investigators 
all over the world, thanks to a major 
international project to identify the 
genetic traits associated with various 
types of cancer. Researchers involved have just 
published six papers in Nature. (Another 16 
papers have been published elsewhere.) 

All eight of us were involved in the six- 
year endeavour. And four of us helped put 
in place safeguards to protect the privacy 
of the thousands of patients and volunteers 


who consented to have their data used in 
the research. Here, we reflect on some of the 
lessons learnt for researchers sharing vast 
amounts of genomic data. 

Genomics researchers worldwide are increas- 
ingly dealing with vast data sets gathered by 
consortia spanning many countries. Most 
are unclear on what to do to protect people’s 
privacy and to comply with international and 
national data-protection laws, especially given 
recent and ongoing changes in legislation. 

Aninternational code of conduct for genomic 
data is nowcrucial. Built bythe genomics com- 
munity, it could be updated as technologies and 
knowledge evolve more easily than is possible 
for national and international legislation. 


Inthe clouds 


Between 2013 and 2019, 468 institutions 
from 34 countries in Asia, Australasia, Europe 
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and North America amassed 2,658 cancer 
genomes — each paired with anon-cancerous 
sequence from the same person. The effort 
was led by the International Cancer Genome 
Consortium (ICGC). 

The combined data were made available 
to investigators largely thanks to cloud com- 
puting. The project — the Pan-Cancer Analysis 
of Whole Genomes (PCAWG) — is the first to 
try to aggregate so many subprojects across 
different jurisdictions and make the entire 
data set available across the world. 

Much of the PCAWG data (and the tools for 
analysing them) were made available through 
the Cancer Genome Collaboratory, a cloud 
service built for the genomic research commu- 
nity. (The commercial cloud-service provider 
Amazon Web Services was also used.) But the 
data were first processed in high-performance 
computer centres and the clouds of academic 
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Server racks in a centre in Berlin. 


institutions inGermany, the United Kingdom, 
the United States, Canada, Spain, Japan and 
South Korea. Some were also processed using 
commercial clouds (Amazon Web Services, 
Microsoft Azure and Seven Bridges). 

Since PCAWG began, several other interna- 
tional genomics projects have turned to the 
cloud (see ‘Cashing in on clouds’) including 
The Human Cell Atlas, an international pro- 
ject to create a reference map of all human 
cells’, and the European Open Science Cloud, 
which is for researchers and professionals 
in science, technology, the humanities and 
social sciences’. 

This trend is likely to continue. Cloud 
services are becoming cheaper and more read- 
ily available, and researchers are increasingly 
reaping the benefits of sharing ever-larger 
amounts of genomic data with international 
colleagues? (see ‘Open data’). 

Yet cloud services bring fresh challenges 
with respect to the protection of participants’ 
data — especially given that national govern- 
ments, law enforcement and private corpo- 
rations are increasingly showing interest in 
accessing them. Canadian border authorities, 
for example, are choosing which country to 
deport migrants to on the basis of DNA test 
results from consumer genomic services’. 


Long-term continuity 
Organizations suchas the Cancer Genome Col- 
laboratory can persist only for as long as they 
are funded. Even in the case of Amazon and 
other major tech companies, service outages 
caused by technical problems, changes tothe 
company’s terms of service or even sudden 
closure of the company could block research- 
ers’ access to data at any time. Also, it is often 
unclear to what extent researchers using cloud 
services can ensure that their data are not dis- 
closed to third parties, such as those conduct- 
ing abusive state-level surveillance. Nor is it 
clear what steps must be taken to protect the 
data against such breaches of confidentiality. 
Inthe case of PCAWG, the ICGC’s Data Access 
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Compliance Office helped to guard against 
some of these issues. Anyone wanting to use 
PCAWG data entered into a contract with the 
project’s data-access committee; they had to 
confirm, for instance, that they would not try 
tore-identify patients or volunteers once these 
people’s data had been stripped of personal 
information. No breach of donor confidenti- 
ality is known to have occurred. 

But even when researchers request the 
data from an associated data-access commit- 
tee (as in the case of PCAWG and elsewhere), 
numerous issues remain unresolved. It is 
unclear, for instance, what vetting should 


Cashingin 
on clouds 


Cloud services have been transformative 
in enabling large-scale genomic analysis. 


Conventionally, any research team wanting 
to analyse an aggregate data set collected 
by a consortium would first have to seek 
authorization from each project partner’s 
research ethics or data-access compliance 
office. It would then have to download 

the data from each subproject over the 
Internet or — more likely — have the hard 
drives containing the data sent by post. 

In the case of the Pan-Cancer Analysis of 
Whole Genomes, which comprises 800 
terabytes of raw data, investigators were 
able to save months, and thousands of 
dollars, by immediately accessing the 

data they needed, and experimenting 

with and customizing the analytical tools 
developed by the community. They could 
also obtain authorization to access most of 
the data from one place — the International 
Cancer Genome Consortium’s Data Access 
Compliance Office. M.P. et al. 
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occur before researchers get access to sensi- 
tive genomic data, or what checks should be 
made before genomic data are shared inter- 
nationally. Even those involved in PCAWG 
could not establish a truly international 
cloud because of restrictions on the transfer 
of data across borders (caused, in this case, 
by European regulators having concerns 
about genomic data from Europeans being 
held inthe United States). 

The US component of the project (The Can- 
cer Genome Atlas; TCGA), which contributed 
one-third of all PCAWG samples, was made 
available to researchers through the Uni- 
versity of Chicago’s Protected Data Cloud. 
Researchers wanting to obtain those data 
had to abide by an access agreement that was 
largely compatible with that provided by the 
ICGC’s Data Access Compliance Office. Ulti- 
mately, however, TCGA remained conceptually 
split off from the rest of the project, because 
researchers had to follow a different access 
procedure and to combine the two data sets 
themselves. 


Code of conduct 


Genomics researchers urgently need clear 
data-sharing rules that are harmonized across 
jurisdictions. 

Aninternational code of conduct could help 
investigators to overcome some of the current 
hurdles, as well as others that might arise as 
legislation on data protection evolves. 

Such a code could outline the steps 
researchers must take to comply with the 
European Union’s General Data Protection 
Regulation (GDPR), implemented in 2018, 
and the US Health Insurance Portability and 
Accountability Act, among other laws. In fact, 
the GDPR explicitly encourages the develop- 
ment of sector-specific data-protection codes 
inits Article 40. And last June, the European 
Data Protection Board (an independent body 
tasked with issuing guidance onthe GDPR and 
encouraging the drawing-up of codes of con- 
duct), issued guidelines on the submission, 
approval and monitoring of such codes for 
data processing. It also promised further guid- 
ance onthe use of codes as a potential way to 
facilitate the transfer of data across borders 
(see go.nature.com/322nkky). 

A European biobanking research infrastruc- 
ture, knownas BBMRI-ERIC, announced in 2017 
that it would develop an EU-wide Code of Con- 
duct on Health-Related Data, to submit tothe 
European Commission (see go.nature.com/ 
2j3ihce). When completed, and if approved, 
such a European code could be beneficial. 
Meanwhile, we call on the genomics research 
community to prioritize the establishment 
of an international code of conduct that lays 
out how existing ethical and legal obligations 
can be satisfied in relation to international 
genomic clouds. 

At least five aspects must be considered. 
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Identifiability. Despite the problems with 
it’, de-identification, in which health data 
are stripped of any information that could be 
used to identify the participant (suchas name, 
social security number, address) has long been 
hailed as a way to protect people’s privacy in 
research®. Yet because of conflicting terminol- 
ogy and gaps in understanding, researchers 
rarely know what standard they must meet 
for their data to be properly anonymized or 
‘pseudonymized’ (in whicha code enables indi- 
viduals to be re-identified)’. What’s more, laws 
are difficult to enforce in practice because it is 
often unclear how breaches of confidentiality 
occurred, or which organization or researcher 
was responsible®”. 

Data-protection laws, such as the GDPR or 
the California Consumer Privacy Act, invar- 
iably require the identifiability of data to 
be analysed on a case-by-case basis, in part 
because the technological tools enabling 
identification are constantly changing. Even 
though it is difficult to lay out hard and fast 
rules in advance, a code could provide some 
guidance on how to evaluate when it is rea- 
sonable to deposit genomic (and health data 
more broadly) in open-access repositories. 
This might involve considering, say, whether 
aset of genomic variants is somatic or present 
inthe germline and so inherited. (Researchers 
have shown that it is possible to identify an 
individual using only afew germline variants”; 
no one has yet been able to identify someone 
onthe basis of somatic tumour variants.) 


Broad consent. The GDPR explicitly 
recognizes an exception to ‘specific consent’, 
meaning the consent people give for their data 
to be used ina specific research project. This 
is to allow participants’ data to be used for 
certain areas of scientific research, in keep- 
ing with recognized ethical standards". Guid- 
ance is needed on what researchers must do 
to meet the requirements for broad consent. 
Furthermore, how should they keep patients 
and volunteers informed about how their data 
are ultimately used? 


Return of individual findings, portability and 
access. Howto safeguard participants’ right to 
move their data around - by giving them their 
data ina machine-readable format, rather than 
as a printed PDF, for example — should be clari- 
fied. The code could also lay out what steps are 
necessary for responsible communication of 
health data toa patient or volunteer”. Should 
people whoare being informed about the iden- 
tification of genomic variants of malignant 
or unknown significance be offered genetic 
counselling, for instance? 


Withdrawal. Researchers need guidance 
on how they can meet participants’ right to 
withdraw from research. The GDPR requires 
that those entrusted with people’s data keep 


Open data 


Over the past three decades, geneticists 
around the world have been sharing 
more and more data. 


Last year, more than 83,000 researchers 
from 146 countries downloaded 

6.7 petabytes of (mainly human) DNA data 
from the European Molecular Biology 
Laboratory’s European Bioinformatics 
Institute. This hosts many biological 

data sets and makes them accessible 
worldwide. That is equivalent to around 
230 billion whole human genomes. 

Such sharing of genomic data will only 
increase as more data become available. 
By 2025, more than 60 million patients 
worldwide are expected to have had their 
genome or exome (protein-coding regions) 
sequenced as part of routine health care” 
— potentially providing a formidable 
resource for researchers. M.P. et al. 


records of third parties to whom they have 
disclosed those data. And when consent is 
revoked, they must notify the third parties. Yet 
allsorts of questions remain, suchas whether 
analyses on aggregate data should be revised 
with the participants’ data removed, and soon. 


Compelled disclosure. A code of conduct 
could provide researchers with guidance on 
how to deal with government requests for per- 
sonal data, including what legal protections 


“Researchers need guidance 
on howthey can meet 
participants’ right to 
withdraw from research.” 


they can appeal to. In the United States, for 
example, the National Institutes of Health’s 
Certificates of Confidentiality are designed to 
shield researchers from such requests”. 


Next steps 


The achievements of PCAWG in relation to the 
sharing and handling of genomic data augur 
well for the development of an international 
code that researchers everywhere can refer to. 

Genomic research consortia, public and pri- 
vate funding bodies, and those working on exist- 
ing regional codes (such as the one in Europe) 
might begin the process of building it. A first 
step would be to convene a meeting to deter- 
mine the topics the code would touch on, the 
best way to consult research participants about 
their needs anda decision-making process that 
will allow the text to be finalized inatimely way. 
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If genomics researchers are instead left in 
the dark about howto properly address data 
protection and sharing, they could either be 
excessively cautious and fail to share as con- 
sents allow, or fail to provide participants 
with appropriate protection”. In other words, 
further regulatory uncertainty risks stalling 
new genomic analyses and undermining peo- 
ple’s faith in scientific collaboration for the 
public good. 
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A migrant farm worker has her fingerprints scanned so that she can register for a national identity card in India. 


The long road to 
fairer algorithms 


Matt J. Kusner & Joshua R. Loftus 


Build models that identify 
and mitigate the causes of 
discrimination. 


n algorithm deployed across the 

United States is now known to 

underestimate the health needs of 

black patients’. The algorithm uses 

health-care costs as a proxy for health 
needs. But black patients’ health-care costs 
have historically been lower because systemic 
racism has impeded their access to treatment 
— not because they are healthier. 

This example illustrates how machine 
learning and artificial intelligence can main- 
tain and amplify inequity. Most algorithms 
exploit crude correlations in data. Yet these 
correlations are often by-products of more 
salient social relationships (in the health-care 
example, treatment that is inaccessible is, by 
definition, cheaper), or chance occurrences 
that will not replicate. 

To identify and mitigate discriminatory 
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relationships in data, we need models that 
capture or account for the causal pathways 
that give rise to them. Here we outline what 
is required to build models that would allow 
us to explore ethical issues underlying seem- 
ingly objective analyses. Only by unearthing 
the true causes of discrimination can we build 
algorithms that correct for these. 


Causal models 


Models that account for causal pathways 
have three advantages. These ‘causal models’ 
are: tailored to the data at hand; allow us to 
account for quantities that aren’t observed; 
and address shortcomings in current concepts 
of fairness (see ‘Fairness four ways’). 

A causal model? represents how data are 
generated, and howvariables might change in 
response to interventions. This can be shown 
as a graph in which each variable is a node 
and arrows represent the causal connections 
between them. Take, for example, a data set 
about who gets a visa to work in a country. 
There is information about the country each 
person comes from, the work they do, their 
religion and whether or not they obtained a 


© 2020 Springer Nature Limited. All rights reserved. 


visa (see ‘Three causal tests’, part 1). 

This model says that the country of origin 
directly influences a person’s religion and 
whether they obtaina visa; so, too, do religion 
and type of work. Having a causal model allows 
us to address questions related to ethics, such 
as does religion influence the visa process? 

But because many different causal models 
could have led to a particular observed data 
set, it is not generally possible to identify the 
right causal model from that data set alone’. 
For example, without any extra assump- 
tions, data generated from the causal graph 
described here could seem identical to those 
from a graph in which religion is no longer 
linked to visa granting. A modeller must there- 
fore also leverage experiments and expert 
knowledge, and probe assumptions. 

Experiments can help in identifying factors 
that affect fairness. For example, a modeller 
wishing to explore whether ethnicity would 
affect treatment recommendations made 
online by health-care professionals could 
create two patient profiles that differ only 
in some respect that relates to ethnicity. For 
instance, one profile could have aname com- 
monto Americans of Chinese descent, andthe 
other anamecommonto Americans of African 
descent. If the treatment recommendations 
are the same, then names can be ruled out as 
a source of bias, and the model can be stress- 
tested in another way. 

Few aspects of a deep, multifaceted concept 
can be tested as easily as changing a name. 
This means that experimental evidence can 
underestimate the effects of discrimination. 
Integration of expert knowledge, particularly 
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from the social sciences and including 
qualitative methods, can help to overcome 
such limitations. This knowledge can be used 
to, for example, inform the modeller of varia- 
bles that might be influential but unobserved 
(lighter circles in ‘Three causal tests’), or to 
determine where to put arrows. 

Assumptions about unobserved variables 
that might alter the predictions of a model 
need to be clearly stated. This is particularly 
important when experiments cannot be run 
or more detailed expert knowledge is not 
available. For example, if ‘health-care access’ 
is not observed in a model attempting to pre- 
dict ‘health need’, then it is crucial to identify 
any potential impacts it might have on ‘health 
costs’ as wellas howitis affected by ‘ethnicity’. 

This need for context and metadata makes 
causal models harder to build than non-causal 
ones. It can also make them a more powerful 
way to explore ethical questions. 


Three tests 


Causal models can test the fairness of 
predictive algorithms in three ways. 


Counterfactuals. A causal model allows us to 
ask and answer questions suchas ‘Had the past 
been different, would the present or future 
have changed?’ Inthe visa example (see ‘Three 
causal tests’, part 1), algorithmic biases could 
be smoked out by tweaking parts of the model 
to explore, for instance: ‘Had individual X been 
Christian, would this algorithm have granted 
them a visa?’ A researcher could then identify 
what pieces of information an algorithm could 
use to achieve counterfactual fairness*: the 
algorithm’s output would not change regard- 
less of the individual's religion. For example, if 
the algorithm used just work and not country 
of origin or religion, it would satisfy counter- 
factual fairness. 


Sensitivity. In many settings, unknowns alter 
knowns — data we can observe are influenced 
by data we cannot. Consider a causal model for 
atrial setting (see ‘Three causal tests’, part 2). 

This model shows howtwo independent sets 
of unobserved quantities, structural racism 
and jury racism, can unfairly lead to a guilty 
verdict. Although researchers often cannot 
precisely identify unobserved variables, they 
can reason about how sensitive a model is to 
them. For instance, they can explore howsen- 
sitive our estimate of the causal link between 
legal representation and guilty verdict is to 
different levels of jury racism. Simulations of 
the worst-case bias scenarios (that is, when 
jury racism is highest) can then be used to 
alter jury selection to minimize the bias. 


Impacts. Data-driven decisions can have 
long-term consequences and spillover effects. 
These effects might not be obvious, especially 
in the standard machine-learning paradigm 


THREE CAUSAL 
TESTS 


Algorithmic fairness can be 
examined in different ways. 
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of predicting one short-term outcome. But 
carefully designed causal models can help 
researchers to use ‘interventions’ to probe the 
ripple effects of decisions far into the future>°. 
For instance, the models can help regulatory 
agencies to understand how changing aschol- 
arship algorithm influences who is accepted 
into lawschool (see ‘Three causal tests’, part 3). 
In this example, a single parent might need a 
scholarship so that they can reduce the hours 
they need to spend at ajob, leaving them more 
time for study. That boosts their grades and 
therefore influences their chances of being 
admitted to law school. This complex chain 
can be explored using causal models. 


Five steps 

Causal models are powerful tools, but they 
must be used appropriately. They are only 
models, and will thus fail to capture impor- 
tant aspects of the real world. Here we offer 
some guidelines on using them wisely. 
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Collaborate across fields. Researchers in 
statistics and machine learning need to know 
more about the causes of unfairness in soci- 
ety. They should work closely with those in 
disciplines suchas law, social sciences and the 
humanities. This will help them toincorporate 
the context of the data used to train the algo- 
rithms. For example, scholars should meet at 
interdisciplinary workshops and conferences. 
One such is next year’s Association for Com- 
puting and Machinery (ACM) conference on 
Fairness, Accountability and Transparency to 
derive a set of causal models for setting bail 
price and for immigration decisions. 

A great example of such collaborations 
is one between information scientist Solon 
Barocas at Cornell University in Ithaca, New 
York, and attorney Andrew Selbst at the Data 
& Society Research Institute in New York City. 
They described how current law is unable to 
deal with algorithmic bias’. Partly in response 
to this work, machine-learning researchers 
have launched a large subfield, known as 
algorithmic fairness, that looks into ways of 
removing bias from data. And we and other 
researchers now use causal models to quantify 
discrimination due to data. 


Partner with stakeholders. Predictive algo- 
rithms should be developed with people they 
are likely to affect. Stakeholders are best placed 
to provide input on difficult ethical questions 
and historical context. One example is the 
work by statistician Kristian Lum at the Human 
Rights Data Analysis Group in San Francisco, 
California, which investigates criminal-justice 
algorithms’. Such algorithms decide whether 
to detain or release arrested individuals and 
how high to set their bail, yet they are known 
to be biased. Lum has invited people affected 
by such decisions to speak at academic confer- 
ences attended by people who research these 
algorithms. This has led to closer collabora- 
tion, including the tutorial ‘Understanding the 
context and consequences of pre-trial deten- 
tion’ presented at the 2018 ACM conference on 
Fairness, Accountability and Transparency in 
New York. So far, most stakeholder work has 
focused on criminal justice. Another setting 
that would benefit from it is mortgage lending. 
We propose that a rotating, interdisciplinary 
panel of stakeholders investigates the impacts 
of algorithmic decisions, for example as part 
of anew international regulatory institute. 


Make the workforce equitable. Women and 
people from minority groups are under-rep- 
resented in the fields of statistics and machine 
learning. This directly contributes to the crea- 
tion of unfair algorithms. For example, if facial 
detection software struggles to detect faces 
of black people’, it is likely the algorithm was 
trained largely on data representing white peo- 
ple. Initiatives such as Black in Al (go.nature. 
com/38pbcaa) or Women in Machine Learning 
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Fairness four ways 


A flurry of work has conceptualized 
fairness. Here are some of the most popular, 
and ways in which causal models offer 
alternatives. 


Fairness through unawareness”. This 
method works by removing any data that 
are considered prima facie to be unfair. For 
example, for an algorithm used by judges 
making parole decisions, fairness through 
unawareness could dictate that data on 
ethnic origin should be removed when 
training this algorithm, whereas data on the 
number of previous offences can be used. 
But most data are biased. For instance, 
number of previous offences can bear the 
stamp of historical racial bias in policing, 
as can the use of plea bargaining (pleading 
guilty being more likely to reduce a sentence 
than arguing innocence)”. This can leave 
researchers with a hard choice: either 
remove all data or keep biased data. 

Alternatively, causal models can directly 
quantify how data are biased. 


Demographic parity“. A predictive algorithm 
satisfies demographic parity if, on average, 

it gives the same predictions to different 
groups. For example, a university-admissions 
algorithm would satisfy demographic 

parity for gender if 50% of its offers went to 
women and 50% to men. It is currently more 
common in law to relax demographic parity 
so that predictions aren't necessarily equal, 
but are not too imbalanced. Specifically, 

the US Equal Employment Opportunity 
Commission states that fair employment 
should satisfy the 80% rule: the acceptance 
rate for any group should be no less than 
80% of that of the highest-accepted group. 
For instance, if 25% of women were offered 
jobs, and this is the highest acceptance rate, 
then at least 20% of men must be offered 


(go.nature.com/2s5km5g) are positive steps. 

And we can go further. Causal models 
can themselves help to address the field’s 
‘pipeline problem’ by identifying where 
unfairness enters the process and which 
interventions can increase the participation of 
under-represented groups without shifting the 
burden to extra work for role models in those 
groups. Academic institutions should critically 
evaluate and use these models for fairer admis- 
sions in fields related to artificial intelligence. 


Identify when algorithms are inappro- 


priate. Statistics and machine learning are 
not all-powerful. Some problems should 
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jobs*. One criticism of demographic parity 
is that it might not make sense to use it in 
certain settings, such as a fair arrest rate for 
violent crimes (men are significantly more 
likely to commit acts of violence)”. 

Instead, one could require that 
counterfactual versions of the same 
individual should get the same prediction’. 


Equality of opportunity”. This is the 
principle of giving the same beneficial 
predictions to individuals in each group. 
Consider a predictive algorithm that grants 
loans only to individuals who have paid back 
previous loans. It satisfies ‘disability-based 
equality of opportunity’ if it grants loans to 
the same percentage of individuals who both 
pay back and have a disability as it does to 
those who pay back and who do not have a 
disability. However, being able to pay back a 
loan in the first place can be affected by bias: 
discriminatory employers might be less likely 
to hire a person with a disability, which can 
make it harder for that person to pay back a 
loan. This societal unfairness is not captured 
by equality of opportunity. 

A causal model could be used to quantify 
the bias and estimate an unbiased version of 
loan repayment. 


Individual fairness”. This concept states 
that similar individuals should get similar 
predictions. If two people are alike except 
for their sexual orientation, say, an algorithm 
that displays job advertisements should 
display the same jobs to both. The main 
issue with this concept is how to define 
similar. In this example, training data will 
probably have been distorted by the fact that 
one in five individuals from sexual or gender 
minorities report discrimination against 
them in hiring, promotions and pay’®. Thus 
similarity is hard to define, which makes 
individual fairness hard to use in practice. 

In causal modelling, counterfactuals 
offer a natural way to define a similar 
individual. M.J.K. & J.R.L. 


not be solved by expanding data-gathering 
capabilities and automating decisions. For 
example, amore accurate model for predic- 
tive policing won’t solve many of the ethical 
concerns related to the criminal legal sys- 
tem. Infact, these methods can mask struc- 
tural issues, including the fact that many 
neighbourhoods are policed by people who 
do notlive in them”. This disconnect means 
that police officers might not be invested 
inthe community they police or the people 
they arrest. 

There are red flags when demographics, 
suchas ethnic origin, influence nearly every 
piece of information in a causal graph, or 
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when previous attempts to address a bias 
failed because people strategically changed 
behaviours in response. In these cases, an algo- 
rithmic solution would paper over a system 
that needs fundamental change. 


Foment criticism. A vibrant culture of 
feedback is essential. Researchers need to 
continually question their models, evalua- 
tion techniques and assumptions. Useful as 
causal models are, they should be scrutinized 
intensely: bad models can make discrimina- 
tion worse”. At the very least, a scientist should 
check whether a model has the right data to 
make causal claims, and how much these 
claims would change when the assumptions 
are relaxed. 


Algorithms are increasingly used to make 
potentially life-changing decisions about 
people. By using causal models to formalize 
our understanding of discrimination, we must 
build these algorithms to respect the ethical 
standards required of human decision makers. 
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Readers respond 


Correspondence 


Ongoing horror of 
2019 oil disaster 


The mystery crude-oil spill that 
struck Brazil from late August 
last year continues to severely 
affect thousands of kilometres 
of the country’s northeastern 
coastline. Remediation and 
containment measures are 
being hampered because the 
source and timing of the spill are 
still unclear. The consequences 
of this environmental and 
societal disaster could last for 
decades. 

Besides the spill’s 
catastrophic impact on the 
region’s marine biodiversity 
(more than 40 of Brazil’s Marine 
Protected Areas have been hit), 
it affects the livelihoods and 
food security of millions of 
coastal residents. In the region 
of Pernambuco, for example, 
sales of fish and shellfish have 
plummeted by around 80% 

(M. E. de Aratijo et al. Cad. Satide 
Publ. http://doi.org/dkq7; 2020). 

The decline in sales of fish 
and seafood in the region is 
exacerbated by public confusion 
over the authorities’ conflicting 
advice on safe eating habits. 
The biological accumulation of 
toxins in food animals is likely to 
pose along-term risk to human 
health. 


Richard J. Ladle* Institute of 
Biological and Health Sciences, 
Federal University of Alagoas, 
Maceio, Brazil. 

*On behalf of 4 correspondents; 
see go.nature.com/2tz2u13 
richardjamesladle@gmail.com 


Don’t cheat Chinese 
environment laws 


Some local authorities in China 
are resorting to ‘quick fixes’ to 
comply with strict regulations 
imposed by the 2014 revised 
Environmental Protection Law. 
Such tactics must be stopped: 
they mask pollution issues 
that could defeat long-term 
environmental goals. 

For example, to pre-empt 
scrutiny and quickly improve 
air-quality rankings in Lanshan, 
Shandong province, 300 or 
so restaurants were shut and 
production at more than 
400 wooden-fibreboard 
factories was stopped. Many of 
these businesses had pollution- 
control measures in place (www. 
mee.gov.cn). In another case, 
farmers in Shangcai, Henan, 
were told to harvest 5 hectares 
of wheat by hand to avoid dust 
from mechanized harvesting 
affecting readings at anearby 
air-monitoring site (J. Qu 
Chutian Metropolis Daily 10 June 
2019). Air-quality monitoring 
data have also been manipulated 
in Xi’an, Shaanxi (D. Liu and 
S. Wang /nt. J. Environ. Sci. 
Technol. 16, 4963-4966; 2019). 

Local leaders’ success should 
be measured by their progress 
on long-term environmental 
improvements. Regional 
governments should therefore 
be allowed a reasonable period 
to address environmental 
issues. The focus should be on 
tackling underlying causes of 
environmental problems, with 
technical backing from central 
government, rather than on 
misguided quick fixes. 


Dasheng Liu Ecological Society of 
Shandong, Jinan, China. 
ecologyliu@163.com 


Renqing Wang Shandong 
University, Qingdao, China. 


Julian R. Thompson University 
College London, UK. 


Ditch group metrics 
for student hopefuls 


As an associate professor at 
an Australian university who 
was educated at unranked 
universities in India, I find 
it disturbing that some 
universities are now using 
international university 
rankings to help assess graduate 
students for admission. In 
my view, this risks promoting 
and institutionalizing 
discrimination, and hence 
undermines global efforts to 
increase diversity in academia. 
When | applied in 1998 to 
doaPhD at the University 
of Zurich, Switzerland, the 
university requested my 
degree-course syllabuses 
from India. My opportunities 
were not scuttled by the 
ranking of those universities. 
Sol was shocked when one of 
my students showed me the 
applications section for master’s 
programmes at several premier 
institutions . These required 
applicants to give the ranking 
of the university where they 
studied as an undergraduate, for 
use as an assessment parameter. 
Such ‘objective’ metrics 
could be viewed as a way to 
reduce the selection workload 
and avoid unconscious biases. 
But individuals should not 
be assessed through a group- 
based metric that reinforces 
stereotypes. And, given 
that university rankings are 
correlated with per capita gross 
domestic product (E. F. Tuesta 
etal. J. Data Inf. Sci. 4, 56-78; 
2019), organizations also risk 
making the serious mistake of 
equating an applicant’s ability 
with regional and economic 
differences. 


Sureshkumar Balasubramanian 
Monash University, Melbourne, 
Australia. 
mb.suresh@monash.edu 


No foundation for 
anti-nuclear bias 


In his otherwise excellent review 
of Thane Gustafson’s book 

The Bridge, Andrew Moravcsik 
includes nuclear power in his 
list of energy sources to which 
natural gas is “environmentally 
superior” (Nature 576, 30-31; 
2019). Burning natural gas 

in fact releases almost half 

as much carbon dioxide into 
the atmosphere as does coal. 
Nuclear power production 
itself releases none. Taking 

into account the CO, released 
from fossil fuels burnt during 
plant construction and uranium 
mining and processing, nuclear 
energy ranks about equally with 
solar power — so still much less 
polluting than natural gas. 

If Moravesik is referring to 
damage from environmental 
releases, nuclear power has 
proved itself to be much cleaner 
and safer than natural gas. If he 
is considering nuclear waste, 
existing practice effectively 
sequesters spent nuclear fuel 
from the environment by using 
dry-cask storage. Permanent 
disposal sites (two currently: 
the Waste Isolation Pilot Plant 
in southern New Mexico, for 
military waste, and Onkalo, 
under construction in Finland) 
will effectively isolate nuclear 
waste for centuries. 

Anti-nuclear bias has no 
place ina pre-eminent journal 
of science, especially when 
global heating is increasing 
dangerously as a result of our 
over-dependence on fossil fuels. 


Richard Rhodes, Half Moon Bay, 
California, USA. 
richardrhodes1@comcast.net 
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Expert insight into current research 


News & views 


Tumour genetics 


Global cancer genomics 
project comes to fruition 


Marcin Cieslik & Arul M. Chinnaiyan 


A massive international effort has yielded multifaceted 
studies of more than 2,600 tumours from 38 tissues, 
generating a wealth of insights into the genetic basis of cancer. 
See p.82, p.94, p.102, p.112, p.122 & p.129 


Comprehensive genomic characterization 
of tumours became a major goal of cancer 
researchers as soon as the first human genome 
had been sequenced in 2001. Since then, 
advances in sequencing technology and ana- 
lytical tools have allowed this research field to 
flourish. In six papers’ ° in this issue of Nature, 
the Pan-Cancer Analysis of Whole Genomes 
(PCAWG) Consortium presents the most com- 
prehensive and ambitious meta-analysis of 
cancer genomes so far. Unlike previous efforts 
that focused largely on protein-coding regions 
of the cancer genome, PCAWG analyses whole 
genomes. Each article scrutinizes an impor- 
tant aspect of cancer genetics — together, their 
findings will be key to understanding the full 
genetic complexity of cancer. 

Before discussing the impact of these 
analyses, it is crucial to highlight the massive 
amount of data and the complex organiza- 
tional framework that underpin the PCAWG 
endeavour. The project involved an interdisci- 
plinary group of scientists from 4 continents, 
with 744 affiliations between them, who had 
to overcome major technical, legal and ethical 
challenges to carry out distributed analyses 
while protecting patient data. Researchers 
were divided into 16 working groups, each 
focused on distinct facets of cancer genom- 
ics — assessing the recurrence of mutations, 
for instance, or inferring tumour evolution. 

Altogether, the consortium performed 
integrative analyses of 38 tumour types. 
The group sequenced 2,658 whole-cancer 
genomes (Fig. 1), alongside matched sam- 
ples of non-cancerous cells from the same 
individuals. These data were complemented 
by 1,188 transcriptomes — the sequences and 
abundances of RNA transcripts in a tumour. 

These efforts involved extensive quality 
control and coordinated data processing, 


as well as massive, systematic experimental 
validation of the computational pipelines used 
to detect mutations. Many computational 
algorithms and pipelines were used and com- 
pared in concert. This required hundreds of 
terabytes of data, spread across multiple data 
centres, and probably millions of processing 
hours — all facilitated by cloud computing. 
Notably, the PCAWG efforts provide a prime 
example of how cloud computing can make 
international collaboration possible and help 
to advance data-intensive fields. 

The first of the current papers! (page 82) 
gives an overview of the breadth and depth of 


the PCAWG data set. The consortium reports 
that, on average, each cancer genome carries 
four or five driver mutations, which provide 
cancer cells with a selective advantage. Only 
5% of tumours studied had no identified 
driver aberrations. By contrast, many cancers 
exhibited hallmarks of genomic catastrophes 
called chromoplexy (17.8% of tumours) and 
chromothripsis (22.3%), which result in major 
structural changes to the genome. 

The other five papers each delve into a 
different aspect of the data set in more detail. 
For instance, in the second paper, Rheinbay 
et al.” (page 102) set out to identify genetic 
drivers in non-coding DNA. This is an ambi- 
tious undertaking, because it is substantially 
more difficult to accurately detect mutations 
innon-coding regions than in coding regions, 
or to assess their recurrence. The authors 
used careful modelling to exclude artefacts 
and systematically identify non-coding driver 
mutations. 

Their results call into question previously 
reported non-coding drivers, suchas the long 
non-coding RNAs NEAT1 and MALAT1, but also 
reveal new ones. For example, the authors 
report arecurrent mutation in a non-coding 
region of the key tumour-suppressor 
gene 7P53. They also found relatively fre- 
quent mutations in non-coding regions of 
the telomerase gene 7ERT that result in over- 
expression of the telomerase enzyme (which 
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Figure 1| A worldwide effort to tackle cancer. The Pan-Cancer Analysis of Whole Genomes (PCAWG) 
Consortium is a group of cancer researchers from four continents (blue). The group sequenced and 
analysed 2,658 whole-cancer genomes from 38 types of tumour. The huge amount of data involved in the 
effort required sophisticated cloud-computing approaches. Six papers’ ° from the PCAWG now describe 
different aspects of the analyses performed. (Nature publications remain neutral with regard to contested 


jurisdictional claims in published maps.) 
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helps tumour cells to divide uncontrollably), 
mirroring the high (12%) prevalence of 
telomerase mutations found in a previous 
pan-cancer study of more-advanced (meta- 
static) tumours’. Although the study could 
notrule out the existence of other non-coding 
drivers, it decisively shows that this type of 
mutation is not common. 

In the third and fourth papers, Alexandrov 
et al.’ (page 94) and Liet al.* (page 112) focus 
on genomic aberrations called signatures. Dif- 
ferent processes, suchas defective DNA-repair 
mechanisms or exposure to environmental 
mutagens, produce these characteristic pat- 
terns of DNA aberrations. Large genomic 
data sets are crucial if we are to refine known 
mutational signatures and discover new ones. 
Impressively, between them, Alexandrovetal. 
and Lietal. identify 97 signatures. This expan- 
sion on previous work encompasses not only 
conventional single-nucleotide signatures, but 
also signatures involving multi-nucleotide var- 
iants and small insertions or deletions of DNA. 

Notably, Li and colleagues are among the 
first to uncover reproducible signatures involv- 
ing structural variants (SVs) — rearrangements 
of large portions of the genome. The process 
was much more intricate than that for identify- 
ing mutational signatures because of the 
diversity and complexity of SVs. 

Throughaseries of mutation-subgrouping 
steps, the researchers identified 16 SV sig- 
natures, revealing, for example, a putative 
mechanistic link between two SVs, deletions 
and reciprocal inversions (the last of which 
involves a reversal of the orientation of aseg- 
ment of DNA). They also gained insights into 
the roles of all 16 signatures in cancer. Muta- 
tions in certain DNA-repair genes were shown 
to associate with characteristic cancer signa- 
tures. For instance, the consortium found that 
mutations in the gene CDK12 associate with 
tandem stretches of duplicated DNA, and that 
truncated variants of the DNA-repair enzyme 
MBD4 co-occur with a distinct mutational 
signature involving DNA sequences called 
CpG sites. Altogether, these new signatures 
lay the foundation for understanding mecha- 
nisms of cancer development, and the role of 
mutagenic exposures in this process. 

The idea that cancer develops through an 
evolutionary process was first presented in 
1976 (ref. 8). Since then, cancer evolution 
has been characterized in terms of random 
mutations and natural selection. A cancer cell 
harbouring a mutation that confers high fit- 
ness proliferates rapidly, becoming the most 
prominent cell clone in the population. This 
phenomenon, called a clonal sweep, occurs 
in recurring cycles during cancer growth. 
Cancer evolution is most effectively studied 
by sequencing multiple regions of atumour 
over time, but it can also be reconstructed 
froma single biopsy — the approach taken by 
Gerstung etal.’ (page 122) in the fifth paper. 
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The authors introduce the concept of 
‘molecular time’ to classify clonal and sub- 
clonal mutations. They reasoned that subclonal 
mutations, which are present in only a subset 
ofatumour’s cells, must have arisen late inthe 
cancer’s evolution. They classify clonal muta- 
tions, which are present in all of a tumour’s 
cells, as early or late, depending on whether 
the mutations arose before or after the clone 
underwent copy-number gains (anincreasein 
the number of copies ofagene or chromosomal 
region). The researchers aggregated evolu- 
tionary data from multiple tumours, allowing 
them to identify common mutational trajec- 
tories such as APC-KRAS- TP53 progression’, 
which describes the typical sequence in which 
mutations arise in colorectal cancer. 

Gerstung et al. found that the driver 
mutationsthat most commonly occurinagiven 
cancer also tend to occur the earliest. Similarly, 
if copy-number gains are highly recurrent ina 
particular cancer type, they tend to occur early. 
For example, a copy-number gain in part of 
chromosome 5is common in clear cell kidney 
cancer, and tends to arise early in the disease’s 
evolution. Conversely, whole-genome dupli- 
cation is a relatively late event in this cancer. 
Finally, the researchers found that mutational 


“The broad availability and 
quality of the new dataset 
will almost certainly spura 
wave of biological insights.” 


signatures change over time in at least 40% of 
tumours. These changes reflect a decreasing 
role for environmental exposures in disease 
progression and an increase in the frequency 
and severity of DNA-repair defects. Overall, the 
group’s findings suggest that driver mutations 
can occur years before cancer is diagnosed, 
which has implications for early detection and 
biomarker development. 

In the final paper (page 129), the PCAWG 
Transcriptome Core Group and their col- 
leagues® made use of the 1,188 PCAWG samples 
that had matched transcriptome data, to func- 
tionally link DNA and RNA alterations. The 
group found associations between hundreds 
of single-nucleotide DNA mutations and the 
expression of nearby genes. However, larger 
copy-number alterations were the main driv- 
ers of gene-expression changes in cancer cells. 
Mutations were also associated with changes 
intranscript structure, suchas the formation 
of a new protein-coding region (an exon) 
within a non-coding region (an intron). 

The authors also characterized the 
frequency of bridged fusions — a phenomenon 
in which two genes become fused owing toa 
third, intervening fragment of DNA. Finally, 
although 87 of the 1,188 samples analysed did 
not have a driver alteration at the DNA level, the 
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group showed that every one of these had an 
RNA-level alteration. Together, these insights 
illustrate the power of integrated RNA- and 
DNA-sequencing analysis for cancer studies”. 

These six papers, together with compan- 
ion papers being co-published elsewhere 
(see go.nature.com/3boajsm), represent a 
milestone in cancer and cloud genomics. 
By focusing on inferences, the PCAWG suc- 
cessfully expands on a decade of cancer 
sequencing studies that were rooted largely 
in observations. It is worth noting that, 
although inferential analyses offer a deeper 
look at cancer than do descriptive studies, 
their results are also associated with a higher 
degree of uncertainty. 

The broad availability and quality of the 
PCAWG data set will almost certainly spur a 
wave of biological insights and methodol- 
ogical developments. Integration with other 
functional genomic data sets, for example 
probing the 3D organization of the genome, 
will also undoubtedly provide further under- 
standing of the causes and consequences of 
genetic aberrations. 

The biggest limitation of the current studies 
is the lack of clinical data concerning patient 
outcomes and treatments. Such data would 
allow researchers to identify the genetic 
changes that can predict clinical outcomes. 
Fortunately, a project called the International 
Cancer Genome Consortium-Accelerate 
Research in Genomic Oncology (ICGC-ARGO) 
is under way that will create such a resource 
for more than 100,000 people with cancer. 

Ultimately, the PCAWG brought together 
thousands of scientists, working together to 
achieve its aims. The long-term impact of these 
efforts will not be limited to the findings 
published today, but will also come from 
the collaborations that have formed and the 
knowledge exchanges that have taken place 
between members of this global consortium 
of researchers. 
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A platform for making and 
transferring oxide films 


Atsushi Tsukazaki 


Crystalline films of technologically useful oxide materials 
have been grown by a method based on surface-modified 
substrates. Unlike usual oxide films, these can be easily 


transferred to any material. See p.75 


Inorganic compounds that contain oxygen 
and at least two other elements are known 
as complex oxides. Crystalline films of these 
compounds have desirable properties such 
as superconductivity, magnetism and ferro- 
electricity (spontaneous electric polariza- 
tion), and could be used in next-generation 
devices’? ifthey can be integrated with mature 
device technologies. Integration is typically 
achieved by growing films on compatible 
substrates using a method called epitaxy, 
but this approach works only for relatively 
limited material systems. In the past few years, 
free-standing membranes of certain oxides 
have been made*» by removing epitaxial films 
from substrates using a process dubbed chem- 
ical lift-off. Now, on page 75, Kumet al.° report 
a versatile method for producing a wide vari- 
ety of complex-oxide films that can be easily 
transferred to any material. 

The authors’ approach uses a technique 
known as remote epitaxy’ °, in which the epi- 
taxial film and the substrate are separated by 


Cobalt 
ferrite film —— 
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Figure 1| Growth and integration of complex-oxide films. a, Kum et al.° 
report a versatile approach for making high-quality films of technologically 
useful compounds called complex oxides and transferring these films to 
other materials. As a demonstration, the authors grew a film of the complex 
oxide cobalt ferrite using a technique known as remote epitaxy, whereby the 
film and the underlying substrate are separated by a few sheets of the material 
graphene. b, They then exfoliated (removed) the film from the graphene 


a few sheets of the two-dimensional material 
graphene (Fig. la). Potential-energy fields pro- 
duced by atoms in the substrate can penetrate 
the graphene and transmit information about 
the substrate’s crystal lattice, enabling the epi- 
taxial growth of high-quality films. The field 
penetrability is proportional to the strength 
of ionic bonds in the substrate material®. A 
film grown in this way can be easily removed 
(exfoliated) from the graphene because the 
two materials are coupled by only weak van der 
Waals forces (Fig. 1b). Remote epitaxy there- 
fore combines outstanding epitaxial growth 
and exfoliation. 

The fabrication of high-quality oxide films 
requires a well-regulated growth scheme and 
atomic-level control over the material inter- 
faces and substrate surfaces’ *. In the past few 
decades, single-crystal oxide substrates of 
various crystal structures have become com- 
mercially available. These include substrates 
of strontium titanate, aluminium oxide and 
magnesium aluminate, which have perovskite, 
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structures. 
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corundum and spinel crystal structures, 
respectively. 

For the epitaxial growth of a particular 
film, the substrate should be appropriately 
selected in terms ofits crystal structure, lattice 
dimensions and coefficient of thermal expan- 
sion — a quantity that describes how the size 
of a material is affected by a change in temp- 
erature. Consequently, growth conditions, 
such as temperature, oxygen pressure and 
growth rate, need to be optimized to stabilize 
the desired crystalline phase and obtain high 
crystallinity. 

In their remote-epitaxy work, Kum and 
colleagues carefully optimized the growth 
conditions of oxide films on graphene-coated 
substrates. In general, control over the degree 
of oxidation is crucial for making high-quality 
oxide films, so the background oxygen pres- 
sure should be well regulated during film 
growth. However, when the authors used a 
growth method called pulsed-laser deposi- 
tionand supplied oxygen to their set-up at the 
required high temperature, they found that 
the graphene was etched from the substrate. 
To prevent this etching, they grew the initial 
part of the oxide film (a thickness of about 
5-10 nanometres, compared with a final thick- 
ness of the order of 100 nm) ina vacuum. The 
crystallinity of this part was still high owing to 
oxidation of the film during the growth of the 
remaining part. Finally, the authors exfoliated 
the oxide film from the graphene to produce 
a free-standing oxide membrane. 

In other experiments, Kum etal. found that 
strontium ruthenate could be used instead of 
graphene to grow an oxide film by a process 
known as sputtering. The film could then 
be exfoliated from the strontium ruthenate 
by depositing a layer of nickel on top of the 
film. The nickel acts as a stressor — it provides 
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to produce a free-standing membrane. c, Finally, the authors integrated 

the membrane with an electrode and a membrane of another complex 

oxide, lead magnesium niobate-lead titanate (PMN-PT), which had also been 
made using remote epitaxy (but replacing graphene with strontium ruthenate). 
Such integration is difficult to achieve using the conventional scheme for 
growing oxide films because cobalt ferrite and PMN-PT have different crystal 
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enough strain energy to overcome the weak 
bonds between the film and the strontium 
ruthenate. 

Kum and co-workers demonstrated the 
transferral of oxide membranes to other 
materials for: strontium titanate, yttrium iron 
garnet and magnetic cobalt ferrite, produced 
by pulsed-laser deposition; lead magnesium 
niobate-lead titanate (PMN-PT), formed 
by sputtering; and ferroelectric barium 
titanate, made by a process called molecu- 
lar-beam epitaxy. One example of a stacked 
structure produced by such transferral con- 
sists of a300-nm-thick layer of cobalt ferrite, 
an electrode and a 500-nm-thick layer of 
PMN-PT (Fig. 1c). 

The authors found that this structure 
displays high magnetostriction (coupling 
between magnetic and mechanical behav- 
iour) and piezoelectricity (coupling between 
electric and mechanical behaviour), because 
it is free-standing rather than being clamped 
by a substrate. Cobalt ferrite, PMN-PT and 
yttrium iron garnet have different crystal 
structures, making it difficult to stack these 
materials by the usual growth scheme without 
such clamping. 

Kum etal. also stacked graphene and oxide 
membranes to examine the electrical cou- 
pling between these materials. The density 
of electric charge in graphene can be inferred 
from the positions of peaks in Raman spectra 
— spectra generated through the scattering of 
incident light. The authors found that these 
positions depend on the stacked structure, 
indicating that charge is transferred across 
graphene-membrane interfaces. These results 
suggest that stacks of other combinations of 
materials will offer ways to integrate the var- 
ious functions of oxides with mature device 
technologies. 

The authors’ exfoliation technique enables 
complex-oxide films to be easily transferred 
from an epitaxial interface to any material. 
Because the thickness and stacking of films 
can be controlled, ultrathin membranes and 
stacks of various membranes could be pos- 
sible. Such a simple way of transferring the 
functions of oxides might advance the field of 
oxide-based electronics through integration 
with emerging quantum material systems”. 
However, the availability of graphene-coated 
substrates could bea key issue for developing 
the method. 

This technique will probably be extended 
beyond the transferral of complex-oxide 
films. For example, it might provide an inno- 
vative strategy for engineering interfaces, 
by allowing 2D or 3D films and membranes 
to be integrated with each other through 
effects associated with multiple couplings 
between them. An understanding of the 
chemical or physical bonds at the interface 
between membranes in stacked structures is 
crucial and will reveal how such an interface 
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differs from the epitaxial one. Finally, unusual 
material combinations (in which, for example, 
the size and orientation of crystal lattices of 
membranes are mismatched) could enable 
useful interface functions that are difficult to 
achieve or control at the epitaxial interface. 
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Latent HIV-1 gets ashock 


Mathias Lichterfeld 


HIV-1 can evade the immune system by hiding out ina dormant 
form. Two studies describe interventions that can effectively 
reactivate the latent virus in animals, potentially rendering it 
vulnerable to immune-mediated death. See p.154 & p.160 


‘Shock and kill’ might sound like a military 
strategy, but in fact it describes the dominant 
model currently used in the search for a cure 
for HIV-1infection. Although antiretroviral 
therapy (ART) is highly effective at limiting 
the extent of the infection, the virus can hide 
out ina ‘latent’ form in immune cells called 
CD4* T cells, undergoing little or no tran- 
scription and thus remaining undetected by 
the immune system!”. When ART is stopped, 
these viral-reservoir cells can rapidly fuel HIV 
rebound. The theory behind ‘shock and kill’ 


“Thecurrent studies 
showcase some ofthe 
conceptual and technical 
challenges intrinsically 
associated with 
pharmacological latency 
reversal.” 


involves the use of drugs that reverse this 
latency and could increase viral gene expres- 
sion (shock), rendering the viral-reservoir cells 
vulnerable to elimination (kill) by other 
cells of the immune system. Two groups** 
now describe distinct interventions in animal 
models that cause what seem to be the most 
robust and reproducible disruptions of viral 
latency reported so far. 

In the first study, Nixon et al.? (page 160) 
focus ona drug called AZD5582, which can 
activate the transcription factor NF-KB —a 
major instigator of HIV-1-gene expression. 
AZD5582 was originally developed to treat 
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cancer, and activates the ‘non-canonical’ 
NF-kB pathway, which results in an atypical 
type of NF-KB-driven transcription that is slow 
but persistent. The authors tested AZD5582in 
two animal models: ‘humanized’ mice (which 
carry human-derived liver, bone-marrow 
and thymus cells) that were infected with 
HIV; and rhesus macaques infected with 
the HIV-related simian immunodeficiency 
virus (SIV). Both groups of animals were already 
receiving ART. 

The authors demonstrated that AZD5582 
treatment led to marked increases in the 
levels of viral RNA in CD4* T cells ina range 
of tissues in both species, indicating that 
transcription of the virus had been activated. 
This was combined with a substantial rise 
in virus levels in the blood. AZD5582 is not 
optimized for use in humans; nonetheless, 
these results suggest that pharmacological 
activation of the non-canonical NF-KB path- 
way could be an attractive way to trigger 
HIV-1-gene expression as part of a shock-and- 
kill approach (Fig. 1). 

In the second study, McBrien et al.* 
(page 154) used an entirely different, though 
complementary, approach to disrupting 
viral latency. Again, the authors used both 
ART-treated humanized mice infected with 
HIV-1 and ART-treated, SIV-infected rhesus 
macaques. They combined two immuno- 
logical interventions. The first involves anti- 
body-mediated depletion of CD8* T cells 
— immune cells previously shown to act in 
concert with ART to reduce levels of viral tran- 
scription’. The second, administered concur- 
rently, involves treatment with a drug called 
N-803, which strongly activates the signalling 


molecule interleukin-15 (IL-15), and which has 
been previously shown’ to activate HIV-1 tran- 
scription in vitro. Like Nixon and colleagues, 
the researchers found that their treatment 
caused substantial increases in virus levels in 
the blood, and in viral RNA in cells from vari- 
ous tissues. 

At first glance, the combined interventions 
used by McBrien and colleagues might seem 
contradictory, because IL-15 is one of the 
strongest activators of CD8* T cells”*. But the 
synergistic effects of these two interventions 
raise the provocative possibility that the best 
strategies for targeting viral-reservoir cells 
involve a mix of immune interventions — sup- 
pressing immune components that seem to 
have a role in stabilizing viral latency (such 
as CD8* T cells) while activating others that 
can effectively disrupt latency (such as IL-15 
signalling). 

How exactly CD8* T-cell depletion 
interacts with IL-15 to reverse HIV-1 latency is 
unknown. Given the vast array of direct and 
indirect effects resulting from depletion of 
CD8* T cells’, it will not be easy to define the 
precise molecular mechanisms underlying this 
synergy. But an understanding of this relation- 
ship might reveal downstream proteins that 
are jointly targeted by these interventions 
and that could therefore be used to optimize 
latency reversal in the clinic. 

In addition to the advances they make, 
the current studies showcase some of the 
conceptual and technical challenges intrin- 
sically associated with pharmacological 
latency reversal. First, the latency-reversing 
agents (LRAs) evaluated (as well as all other 
LRAs described so far’°) target factors that 
have crucial roles in modulating host-cell 
gene transcription, in addition to viral tran- 
scription. Their use therefore comes with an 
intrinsic risk of toxic off-target effects. The 
toxicity of the LRAs described by McBrien 
et al. and Nixon et al. seems to be acceptable 
in animal models, with most showing no clin- 
ical side effects. However, much more strin- 
gent safety standards must be met in human 
clinical trials. 

Mechanisms of viral latency might vary 
between individual viral-reservoir cells and 
are likely to be influenced by the position at 
which the HIV-1 genomes have integrated into 
the host-cell chromosomes”. It is therefore 
possible that only subsets of cells will respond 
to individual LRAs, which typically target 
one specific mechanism of viral latency. The 
actual proportion of viral-reservoir cells that 
responded to the interventions in the two cur- 
rent studiesis uncertain, and would be difficult 
to determine experimentally”. 

Another uncertainty is how much of the 
increase in HIV-1 RNA is attributable to 
CD4° T cells carrying HIV-1 that can replicate 
effectively? This is of interest because most 
viral-reservoir cells harbour HIV-1 genomes 
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Figure 1| Two approaches to reactivating dormant HIV-1. HIV-1 can integrate into the genome of 

CD4*T cells ina latent form — it is not transcribed into messenger RNA and so is not detected by the body’s 
immune system. Two papers describe ‘shock’ treatments that can reactivate transcription of latent HIV 

in mice and the related virus SIV in monkeys. Nixon et al.’ used a drug called AZD5582 to activate the 
non-canonical NF-kB signalling pathway, which stimulates virus transcription. McBrien et al.* used two 
interventions — a drug called N-803 to stimulate the protein IL-15, which promotes transcription, and an 
antibody treatment that depletes immune cells called CD8" T cells, which seem to have a role in dampening 
HIV transcription. After these shock treatments have reactivated the virus, interventions that target and kill 
the virus-carrying CD4* T cells should help to eliminate the latent viral reservoir. Such treatments remain to 


be designed. 


that contain lethal sequence defects, probably 
as aresult of errors introduced during reverse 
transcription of viral RNA, which produces 
the viral DNA that is integrated into the host 
genome. These defective viral genomes can 
often still be transcribed and respond toLRAs, 
but they cannot cause viral rebound when ART 
is stopped and so do not represent the main 
target for shock-and-kill interventions. In 
addition, it is unclear how disrupting latency 
might influence the evolutionary dynamics 
of the reservoir cells — whether, for instance, 
a shock treatment kills some subsets of 
CD4* T cells that are highly susceptible to 
latency disruption, but confers a selective 
advantage on other subsets of non-susceptible, 
difficult-to-reactivate cells. 

Most importantly, neither of the inter- 
ventions tested in the current studies led 
to a change in the expression of markers 
of viral-reservoir size. A decrease in these 
markers is the most informative and cru- 
cial endpoint parameter for shock-and-kill 
approaches. The absence of an effect on 
viral-reservoir size probably reflects the fact 
that the studies were mainly designed to inves- 
tigate latency reversal, and lacked dedicated 
‘kill’ interventions. Combining ‘shock’ inter- 
ventions with ‘kill’ components is a key next 
step. In fact, that they provide a suitable model 
for evaluating ‘kill’ strategies in the setting of 
robust and efficient latency reversal might be 
one of the strengths of the current studies. 

Finally, the work of Nixon and colleagues 
and McBrien and colleagues should not 
distract from the fact that the shock-and-kill 
strategy currently remains largely a theoretical 
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concept, nota therapeutic reality. Establishing 
evidence for its ability to reduce viral reser- 
voirs and to deliver real benefits to patients 
will require much more work. 
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Accelerator physics 


Muon colliders 


comeastep closer 


Robert D. Ryne 


Particle colliders that use elementary particles called muons 
could outperform conventional colliders, while requiring 
much smaller facilities. Muon cooling, a milestone on the road 
to these muon colliders, has now been achieved. See p.53 


“SMASH! Colossal colliders are unlocking the 
secrets of the universe.” The cover story of 
the 16 April 1990 issue of Time magazine dis- 
cussed giant particle accelerators, including 
the Superconducting Super Collider in Texas, 
which was ultimately judged to be too expen- 
sive for completion. Researchers at CERN, 
Europe’s particle-physics laboratory near 
Geneva, Switzerland, went on to construct 
the Large Hadron Collider (LHC) in an existing 
tunnel. The LHC and other accelerators have 
been responsible for many major discoveries, 
but these “colossal colliders” have become 
colossally costly. Innovative approaches will 
thus be required to reduce the expense of 
future colliders in the search for previously 
unseen particles and physics phenomena. On 
page 53, the Muon Ionization Cooling Experi- 
ment (MICE) collaboration! reports results that 
bring scientists a step closer to realizing one of 
these innovative approaches: a muon collider. 

Muons, like electrons, are elementary par- 
ticles in the standard model of particle phys- 
ics, but they have about 200 times the mass of 


Target 


Secondary 
particle 


electrons (go.nature.com/3twyjba). This fact 
has ramifications for the size, and therefore 
cost, of colliders, and for the energy that can 
be reached in their particle collisions (and thus 
their potential for discovery). 

Although the goal is to accelerate particles 
so that they collide at the highest possible ener- 
gies, the particles actually lose energy through 
radiation when their trajectories are bent by 
accelerator magnets. Heavy particles such as 
protons and muons lose much less energy than 
do lightweight particles such as electrons. For 
this reason, the circular colliders that can reach 
the highest energies (for example, the LHC) 
use protons. However, protons are not elemen- 
tary particles. They are made up of elementary 
particles called quarks, and because the colli- 
sions are between bound quarks, only about 
one-sixth to one-tenth of the energy from 
proton collisions is available to produce other 
particles”. By contrast, because muons are ele- 
mentary particles, all of the energy from their 
collisions is available for particle production. 

Muon accelerators would have uses beyond 
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Figure 1 | Production and ionization cooling of muons. The MICE collaboration’ carried out an experiment 
in which a beam of protons was directed at a target to generate secondary particles. Some of these particles 
decayed into elementary particles known as muons. The positions and velocities of the muons in the 
resulting beam had a wide spread (indicated by the dashed lines) in the directions perpendicular to the 
beam. Finally, the muons passed through an energy-absorbing medium made of lithium hydride or liquid 
hydrogen that reduced this spread by a process called ionization cooling. The process demonstrated by the 
authors could someday lead to a muon-based particle accelerator. 
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those for particle colliders. For example, a 
‘Higgs factory’ is a highly desirable facility that 
would produce huge numbers of elementary 
particles known as Higgs bosons and allow 
the properties of these particles to be pre- 
cisely determined. A Higgs factory based on 
aconventional linear accelerator that collides 
electrons and positrons (the antiparticles of 
electrons) would have to be 10-20 kilometres 
long®. But one based ona circular muon col- 
lider would require a circumference of only 
about 0.3 km (ref. 4). In another example, if 
muons could be stored in a racetrack config- 
uration that has long, straight sections, the 
decay of the muons in these sections would 
produce intense neutrino beams. Sucha facil- 
ity, called aneutrino factory, would shed light 
on the mysteries of neutrinos and on physics 
beyond the standard model. 

Before aneutrino factory or amuon collider 
can exist, scientists must learn howto manip- 
ulate muon beams. Unlike electron beams, 
which are produced with almost laser-like 
quality, muon beams are generated through 
acomplicated process resulting in a beam that 
ismore reminiscent of the spray of pellets from 
a shotgun. This spray needs to be converted 
into a laser-like beam. 

Such a conversion involves reducing the 
spread of the muons’ positions and velocities 
in the directions perpendicular to the beam. 
A temperature can be associated with this 
spread, and cooling the beam decreases the 
spread. Several cooling techniques are used at 
accelerators, but none is fast enough to cool 
muons, which are unstable and short-lived. 

Instead, a method called ionization cool- 
ing has been proposed for cooling muon 
beams?*, although it has never been used. In 
this approach, muons travel through an accel- 
erator, a portion of which contains a material 
of low atomic mass, and the spread of the 
muons’ positions and velocities is reduced 
as the particles ionize atomic electrons inthe 
material. The MICE collaboration’s aim was to 
build and test asystem for the ionization cool- 
ing of muons, to demonstrate this cooling for 
the first time and to validate simulation tools 
for the design of ionization-cooling systems. 

Inthe authors’ experiment, a proton beam 
from the ISIS accelerator at the Rutherford 
Appleton Laboratory near Didcot, UK, struck 
atarget to produce secondary particles (Fig. 1). 
Some of these particles decayed into muons, 
which were directed into an experimental 
apparatus consisting of focusing magnets, 
beam instrumentation and a cooling section 
that contained an energy-absorbing medium 
made of lithium hydride or liquid hydrogen. 

Accelerator experiments usually measure 
the basic properties of abeam, suchas its cen- 
tre of mass, its spread in particle positions or 
its density profile. To demonstrate ionization 
cooling, the MICE collaboration took the 
unprecedented step of using the technology 
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of collider detectors to measure both the 
input and output coordinates and velocities 
of every individual muon that passed through 
the experimental apparatus. As a result, the 
authors could unequivocally demonstrate that 
they had achieved ionization cooling of muons. 

Organizations worldwide are developing 
long-term strategies for exploring the 
high-energy frontier. Plans include designs for 
circular colliders upto 100 kmin circumference 
andlinear colliders up to 50 kmlong’. Although 
these approaches, which would use protons or 
electronsand positrons, have the least technical 
risk, they still have a substantial cost, as well as 
technical challenges, that affect their feasibility. 

Other plans include designs that would use 
innovative technologies such as those based 
on lasers and plasmas’. These approaches have 
made great progress in developing compact 
accelerator stages at low energy, but the com- 
bined use of such stages to reach high energies 
while retaining a high beam quality will require 
many years of research and development. Still 
other plans involve muon beams’. 

Thanks to the MICE collaboration, the first 
demonstration of ionization cooling of muons 
has been achieved. However, it must be noted 
thatthe amount of cooling was small. Although 
conceptual designs for muon colliders have 
been developed’, establishing the viability of 
arealistic muon-cooling system and of amuon 
collider will need much more work. 

It is too soon to say which, if any, of the pro- 
posed approaches will provide a technically 
and financially feasible path to the future 
energy frontier. But if physicists can learn how 
tocooland control muon beams, thenit is hard 
to imagine that putting muons ina circular col- 
lider will not be the way forward. These parti- 
cles offer clean collisions (unlike protons) and 
lose little energy when their trajectories are 
bent by accelerator magnets (unlike electrons). 
As aresult, a muon collider could reach ener- 
gies that match or surpass those of an electron 
or proton collider, but be substantially smaller. 
The MICE collaboration’s work is a milestone 
onthe road to realistic muon-cooling systems 
that could someday lead to neutrino factories 
and muon colliders. 
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Climate science 


Early models successfully 
predicted global warming 


Jennifer E. Kay 


Climate models published between 1970 and 2007 provided 
accurate forecasts of subsequently observed global surface 
warming. This finding shows the value of using global 
observations to vet climate models as the planet warms. 


Climate models are equations that describe 
climatically relevant processes and are solved 
onsupercomputers. In addition to being inval- 
uable tools for testing scientific hypotheses, 
these models have long provided societally 
important forecasts. The first climate models 
to numerically describe an evolving and inter- 
acting atmosphere, ocean and land surface on 
a grid covering the entire Earth date back to 
the 1970s (for example, refs 1-3). Since then, 
the planet’s surface has warmed, in large part 
because of increased emissions of greenhouse 
gases. Writing in Geophysical Research Letters, 
Hausfather et al.‘ retrospectively assessed the 
forecasting skill of climate models published 


between 1970 and 2007. Their results show 
that the physics in these early models was 
accurate in predicting subsequently observed 
global surface warming. 

A key point emphasized by the authors is 
that the forecasting ability of climate models is 
limited by unknowable future climate drivers. 
Many major drivers, such as increased concen- 
trations of carbon dioxide in the atmosphere 
caused by the burning of fossil fuels, result 
from human activities and decisions. Early cli- 
mate modellers included estimates for future 
climate drivers in their forecasts. However, 
they could not know, for example, how the 
world would industrialize or the associated 


Figure 1| A Univac 1108 computer, from 1972. Hausfather et al.* demonstrate that climate models 
published over the past five decades accurately predicted subsequently observed changes in Earth’s global 
mean surface temperature. These models include ones reported in the 1970s that used supercomputers, 
such as the Univac 1108, that had extremely limited power relative to those used today. 
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emissions of CO, that would result. 

Hausfather and colleagues developed a 
method for evaluating the forecasts of early 
climate models without penalizing the models 
for their inaccurate estimates of unknowable 
future climate drivers. The authors examined 
17 projections of global mean surface temper- 
ature (GMST) from 14 models. Before applying 
their method, they found that 10 projections 
were consistent with observations. But when 
inaccuracies in the estimates of climate driv- 
ers were taken into account, the authors 
discovered that 14 projections agreed with the 
data. Of the three that did not, two predicted 
higher-than-observed surface warming and 
one predicted lower-than-observed warming. 

Developing credible climate models 
through an understanding of climatically 
relevant processes, observations and 
well-formulated equations is a considerable 
scientific and computational challenge. The 
equations that describe climate are complex 
and require substantial computing power to 
solve. As a result, climate models have always 
been run onthe fastest supercomputers avail- 
able. It is especially impressive that the earliest 
models assessed by Hausfather et al. produced 
accurate GMST forecasts, given the extremely 
limited computing power available then com- 
pared with that used today (Fig. 1). 

Although the authors’ findings show 
that climate models can accurately predict 
GMST, these forecasts are insufficient for 
understanding and preparing for the effects 
of ongoing climate change. For instance, 
regional climate change is especially subject 
to unpredictable climate variability, which 
greatly limits forecasting potential —evenon 
decadal timescales when the climate drivers 
are known’. Moreover, on the basis of GMST 
forecasts alone, itis hard to predict, for exam- 
ple: to what extent sea level will rise; how ocean 
acidification caused by uptake of atmospheric 
CO, will influence marine ecosystems; and 
the frequency and magnitude of future fires, 
droughts and floods. 

Scientists will have to continue to improve 
climate modelling and to increase their under- 
standing of the effects of climate change, while 
keeping in mind the tension between the need 
for increased model resolution, greater rep- 
resentation of climatically relevant processes, 
and more simulations to characterize unpre- 
dictable climate variability. The successful 
forecasting of GMST by early climate models 
isimpressive, but leaves much work to be done 
—asscientists, policymakers and stakeholders 
are all well aware. 

Numerical models based on scientific equa- 
tions describing the atmosphere are used daily 
to make decisions that save lives and money. 
As the climate continues to change owing 
largely to human activities, scientists need 
to use, improve and communicate the value 
of numerical models and the equations and 
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knowledge that underlie them. Hausfather 
and colleagues’ work demonstrates that the 
physics in climate models has been providing 
accurate forecasts of GMST under increasing 
amounts of atmospheric CO, for decades. 
Such predictions are useful for estimating the 
maximum amount of CO, that can be released 
into the atmosphere over time to keep surface 
warming to a specified level. 

Crucially, the authors’ results also show 
that a major source of uncertainty in GMST 
forecasts comes from climate drivers. And, 
of these drivers, it is emissions of greenhouse 
gases from human activity that will largely 
determine future surface warming. The find- 
ings indicate the usefulness of climate-model 
predictions of GMST in response to increasing 
greenhouse-gas emissions, despite unknowa- 
ble future climate drivers. But scientists must 


Cancer 


also continue to develop climate models in 
concert with everything else availableto them, 
to plan for a changed climate that requires 
much more than forecasts of surface warming. 
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Brain tumours manipulate 
neighbouring synapses 


Nicola J. Allen 


The growth of a brain tumour can be affected by the activity of 
its neighbouring neurons. The finding that such tumours send 
signals that boost connections between these neurons reveals 
a pathway that drives cancer growth. See p.166 


A type of non-neuronal brain cell called a 
glial cell can give rise to a lethal cancer called 
glioblastoma’. Half of the cells in the human 
brain are glial cells, which normally act to 
support the function and communication of 
neurons’. Yet despite decades of research, 
there are no existing treatments for glio- 
blastoma that substantially increase the 
survival time of people with such tumours. 
On page 166, Yu et al.’ report their analysis 
of the effects on the brain of certain glioblas- 
toma-associated mutations. These insights 
might open up new strategies for anticancer 
research. 

DNA sequencing of cancers has identified 
many tumour-associated mutations. How- 
ever, it is a challenge to determine which of 
these mutations have a causal role in tumour 
development and growth, and which have no 
effect. Moreover, some mutations can be con- 
text dependent, such that the same mutation 
might have differing effects depending onthe 
type of tumour and its microenvironment. It 
is hard to predict whether different muta- 
tions (variants) of the same gene will lead to 
the same outcome in different tumour types. 

Yu and colleagues tackled these issues 
by studying the RTK-RAS-PI3K signalling 
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pathway in glioblastoma. This pathway is 
altered in 90% of glioblastomas*”, and muta- 
tions in it boost cell division and tumour 
growth. The authors focused on mutations 
in the gene encoding the enzyme PIK3CA — a 
pathway componentthatis often abnormalin 
human glioblastomas. 

The authors generated mouse models of 
glioblastoma, and mutated genes in the RTK- 
RAS-PI3K pathway using the gene-editing 
tool CRISPR-Cas9, which resulted in tumour 
growth. Yu et al. then engineered animals to 
express PIK3CA variants that are found in 
human glioblastomas (Fig. 1). This revealed 
that many of the tested variants made tumours 
more aggressive and rapidly lethal. This was 
the case both for known variants and for others 
that had not previously been associated with 
arole in glioblastoma. 

Yu and colleagues studied whether alter- 
ations in the enzymatic activity of PIK3CA 
variants might explain how they accelerate 
glioblastoma progression. Surprisingly, the 
activity of PIK3CA was not always linked to 
an effect on tumour growth — some variants 
strongly increased enzyme activity, whereas 
others had amuch milder effect. To investigate 
other mechanisms that might explain the effect 


ontumour growth, the authors performed RNA 
sequencing of tumour cells. Most PIK3CA var- 
iants had patterns of altered gene expression 
that were similar to each other in the animals’ 
tumours, but different from the pattern in 
healthy tissue. However, two variants (named 
C420R and H1047R) had distinctive patterns 
compared with each other and with the other 
variants. Tumours containing either of these 
two variants showed abnormally altered 
expression of hundreds of genes that regulate 
synapses (structures that connect neurons). 
C420R and H1047R were also each associated 
with alterations in different categories of 
synapse-regulating gene. 

Seizures are caused by a rise in neuronal 
activity and are often an early symptom of 
glioblastoma®. This effect might be driven by 
an abnormal increase in the connectivity of 
excitatory synapses (those that drive neuronal 
activity) anda decrease inthe connectivity of 
inhibitory synapses (which decrease neuronal 
activity). Yu and colleagues report that mice 
with tumours that express C420R or H1047R 
variants had seizures, whereas a PIK3CA 
variant termed R88Q that does not alter syn- 
apse-related gene expression in tumours did 
not induce seizures. Compared with tumours 
expressing the R88Q variant, those expressing 
C420R or H1047R showed an increase in the 
number of excitatory synapses and a decrease 
inthe number of inhibitory synapses in tissue 
surrounding the mouse tumours. This pattern 
might explain how certain PIK3CA variants 
drive neuronal excitability and seizures. 

When the authors engineered mouse brains 
to express increased amounts of the PIK3CA 
variants (without expressing other mutations 
that drive glioblastoma), they found that, in 
the absence of glioblastoma, H1047R drives 
seizures whereas C420R and R88Q do not. This 
indicates that H1047R acts ‘cell autonomously’ 
to regulate neuronal excitability — meaning 
that it functions within the cell itself rather 
than acting on a neighbouring cell. 

To determine how C420R might drive 
seizures, the authors expressed PIK3CA var- 
iants ina type of glial cell called an astrocyte 
that enhances the formation of synapses 
between neurons by releasing secreted 
molecules. The authors cultured astrocytes 
in vitro with neurons and assessed the astro- 
cytes’ ability to induce neuronal synapse 
formation. Astrocytes that expressed C420R 
induced higher than normal synaptic forma- 
tion between neurons, whereas astrocytes that 
expressed H1047R had no such effect. This 
suggests that C420R changes the properties 
of glioblastoma cells to make them induce the 
formation of synapses between the neurons 
that they contact. 

Further analysis of RNA sequencing data 
by the authors revealed that tumour cells 
that expressed C420R showed increased 
expression of known synapse-regulating 
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Figure 1 | Tumour modulation of neuronal connections. Yu et al.’ report their analysis of the brain cancer 
glioblastoma, which arises from the growth of non-neuronal (glial) cells. Using mouse models, the authors 
analysed mutations found in people who have glioblastoma. They report that a mutant form of the protein 
PIK3CA — the C420R variant — is associated with expression of genes in the cancer cell that can regulate 

the synaptic connections between neurons. One such gene encodes the protein glypican 3 (GPC3), which is 
secreted by cells and can boost synapse formation (synapses shown in pink). This leads to a rise in synaptic 
activity, which was associated with tumour progression. Enhanced neuronal activity might explain why 


seizures are associated with glioblastoma. 


factors that are secreted by astrocytes. 
These included members of the glypican 
family (part of a group of sugar-containing 
proteins called proteoglycans), which can 
induce the formation of excitatory synapses 
inthe brain’. 

The expression of one member of this 
family, glypican 3, was particularly highly 
upregulated in tumour cells that expressed 
C420R. When the authors engineered glio- 
blastomas expressing C420R to lack glypi- 
can 3, this caused a decrease in seizures and 
a longer lifespan compared with mice that 
had C420R-expressing glioblastomas that 
expressed glypican 3. By contrast, animals 
with glioblastomas engineered to have higher 
expression of glypican 3 had more seizures and 
a decreased lifespan compared with animals 
with glioblastomas that did not overexpress 
glypican 3. This suggests that glypican 3 is 
necessary and sufficient in the context of glio- 
blastoma to regulate seizures by controlling 
synapse formation in the neuronal tissue 
around the tumour. 

Yu and colleagues’ work supports grow- 
ing evidence that tumour cells interact with 
their neighbouring healthy cells to alter brain 
function. Recent work has revealed that cells 
of the glioblastoma itself can form a synapse 
with surrounding neurons (in such synapses, 
the tumour has postsynaptic structures and 
the neuron forms presynaptic structures), 
and that blocking this synaptic input to the 
glioblastoma decreases tumour growth®”’. 
Yu et al. now reveal how a glioblastoma can 
remodel connections between its neighbour- 
ing neurons, and report that the underlying 
mechanism differs depending onthe specific 
PIK3CA variant involved. It will be interest- 
ing to determine in future studies whether 
the same synapse-promoting signals are 
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responsible for regulating the synapses that 
form betweena tumour and its neighbouring 
neurons, and for regulating synapses that form 
between neurons surrounding the tumour. 
This is an intriguing matter, because glypi- 
cans regulate neuronal synapse formation 
through what are known as AMPA receptors, 
and the synapses that form between neurons 
and glioblastoma cells use AMPA receptors 
for signalling®’. 

Abnormally high expression of glypican 
family members occurs in various types 
of tumour, including liver and pancreatic 
cancer" ”, In those cases, glypicans are 
thought to bind to and regulate the signal- 
ling of growth factors that promote tumour 
growth. Glypicans are clearly of interest in try- 
ing to understand many disorders, and efforts 
being made to block their function in other 
cancers might suggest approaches worth 
testing for use in glioblastoma treatments”. 
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Solar heating of acometary surface provides the energy necessary to sustain gaseous 
activity, through which dust is removed’”. In this dynamical environment, both the 
coma** and the nucleus* evolve during the orbit, changing their physical and 
compositional properties. The environment around an active nucleus is populated by 
dust grains with complex and variegated shapes’, lifted and diffused by gases freed 
from the sublimation of surface ices®”. The visible colour of dust particles is highly 
variable: carbonaceous organic material-rich grains” appear red while magnesium 
silicate-rich"’” and water-ice-rich’** grains appear blue, with some dependence on 
grain size distribution, viewing geometry, activity level and comet family type. We 
know that local colour changes are associated with grain size variations, such as inthe 
bluer jets made of submicrometre grains on comet Hale-Bopp’* or in the fragmented 
grains in the coma” of C/1999 S4 (LINEAR). Apart from grain size, composition also 
influences the coma’s colour response, because transparent volatiles canintroducea 
substantial blueing in scattered light, as observed in the dust particles ejected after 
the collision of the Deep Impact probe with comet 9P/Tempel 1”. Here we report 
observations of two opposite seasonal colour cycles in the coma and on the surface of 
comet 67P/Churyumov-Gerasimenko through its perihelion passage"®. Spectral 
analysis indicates an enrichment of submicrometre grains made of organic material 
and amorphous carbon in the coma, causing reddening during the passage. At the 
same time, the progressive removal of dust from the nucleus causes the exposure of 
more pristine and bluish icy layers on the surface. Far from the Sun, we find that the 
abundance of water ice on the nucleus is reduced owing to redeposition of dust and 
dehydration of the surface layer while the coma becomes less red. 


Understanding how comets work and evolve is one of the most compel- 
ling questions to which the Rosetta mission’ has been trying to find an 
answer. Unlike past fly-by missions to comets, Rosetta was designed 
to enter the same orbit and accompany comet 67P/Churyumov-Ger- 
asimenko (hereafter 67P) through its perihelion passage, giving us 
the unique opportunity to follow the colour changes developing on 
acomet nucleus and coma in its active phase through the inner Solar 
System. So far, several studies have investigated how dust and gas 
(H,O, CO,) production? are correlated during one cometary rotation, 
showing that the water-ice sublimation on the surface of 67P is the 
driving mechanism for dust ejection in the coma”. In fact, the dust 
emission flux appears lower above high-cohesion consolidated terrains 
and reaches a maximum when the subsolar direction is aligned above 
volatile-rich areas. As the illumination conditions continuously change 
on 67P, rather than focusing on coma and nucleus colour variations 
occurring during a few diurnal rotations, we analysed the entire dataset 


returned by the Visible, Infrared and Thermal Imaging Spectrometer 
(VIRTIS)” on Rosetta. Our method allows exploration of the comet’s 
colour evolution starting from January 2015 (inbound orbit, heliocen- 
tric distance 2.55 AU), encompassing perihelion passage (August 2015, 
1.24 AU) until May 2016 (outbound orbit, 2.92 Au). Spectral indicatorsin 
the visible range, including the integrated radiance (/), the wavelength 
of the radiance peak (A,,,,,) and the spectral slopes (in the 0.4-0.5 and 
0.5-0.8 pm ranges), are synergistically used to model the composition 
and grain size distribution of the dust particles in the coma as a func- 
tion of heliocentric distance. The spectral indicators are derived for 
each spectrum acquired by VIRTIS in an annulus encircling the nucleus 
and comprising all the pixels inthe coma located at a tangent altitude 
between 1.0 and 2.5 km above the nucleus’ limb. At the same time, we 
monitor the colour changes occurring on 12 test areas on the nucleus 
through the 0.5-0.8 pm spectral slope*®. The description of the dataset, 
including an example of VIRTIS images and spectra after calibration 


'INAF-IAPS, Institute for Space Astrophysics and Planetology, Rome, Italy. 7LESIA, Observatoire de Paris, Université PSL, CNRS, Sorbonne Université, Université Paris Diderot Sorbonne Paris Cité, 
Meudon, France. “German Aerospace Center (DLR), Institute of Planetary Research, Berlin, Germany. “INAF—Osservatorio Astronomico di Capodimonte, Naples, Italy. *e-mail: gianrico. 


filacchione@inaf.it 


Nature | Vol578 | 6 February 2020 | 49 


Article 


a b 
ZT 50 a 2215 : - Fig. 1| Time series of the spectral 
v 8 4 Ee y ‘ ae properties of 67P coma dust. 
y ao? = Pa io 
E 40E 0 £ee 2 = ike a a, Integrated radiance (/): full 
= oe 4 E «— fE Sa i 
a aa ‘B A 5 = = “5 dataset, partial annulus coverage 
§ 30 i . rine 0 3 i (black points); reduced dataset, 
8 5h vooeg csi % 3 a complete annulus coverage (red 
3 oh 3.93 i 3 ? points; see discussionin Methods). 
7 ee . xe) 
$ 10 F 1B H cA , 2 Ofs b, The 0.4-0.5 um spectral slope. 
8 ‘. i are, e : 2 [ c, Visible radiance wavelength peak 
> obser! Ur ee : : 7 6S Amnax- d, The 0.5—0.8 xm spectral 
E 3 3 8 8 8 3 3 8 Pd 3 3 33 2 Sy 3 8 Pa slope. Throughout the paper, 
PN a FE ee oe OFS EN RS ES eM spectral slopes are givenin pm” 
c G60 3 (u) d 10 - corresponding to10%/100 nm colour 
& poe t E 7 Gee : slope. Rosetta’s MTP intervals are 
% ries = are fasts marked by vertical lines between 
2 0.55 tz € 6Le fesacas ae MTPO12 and MTPO28. Time intervals 
> [S, a rc 2 tie, . ‘ and heliocentric distance (D,) of MTP 
o if - oe Z 46 Ae ee periodsare listed in Extended Data 
6 0.50} ary Se bt ae tes Stee, aussi i 
2 L al ght @ of Tt Oe Table 1. Perihelion occurs during 
oo eons Q ‘ A 
2 : § ; a 2e5 é rere Pe MTPO19. Dates are displayed as year- 
3 0.45; Mie! a , 3 OFF Hae 5 fe i or month-day. 
rs : .B fhe ue gue ee 
ra ; : , | > OCs f : tee ee Seles 
> To ee OP SP A Qe ee Bae Te eS PS Pe NN ee FS ae 
TA MTN DORWADOrA Tr aw Tt rN MTN DOOR DADOYTN raAvYM Tt 
PPP OOOO OO On TT 2 Pe See eooOo °F 7 eT PF ee 
owonwnnnnnnnnnwnn © OO O onounwnounwnunnnnnnnnn © OO O 
SSSSCOSCFSOSCSFOSCFO OOS SSOCOCFOCSOSFOOFSSFOOO FOOD 
NNNNNNNNNNNNN NNN CO NNNNNNNNNNNNN NNN WS 
Date Date 
Hatmeit—Bastet, lon = 2.5°, lat = +4.5° Aker, lon = 54.0°, lat = -15.5° Babi, lon = 89.5°, lat = +45.5° Imhotep, lon = 137.5°, lat = -12.0° 
3.0 3.0 3.0 3.0 
Ea _— E Z 
B25) 5 ote = 25} = 25f & 25} 
a fas : rod 28 a 3 eee 0 
5 e e § eo @ iS) ee rs) ee 
® 2.0 : ° ® 2.0;0 .° D 2.0bee Seeee Oy ® 2.0t6 e 
E E . E > a g * Jo 000? 
= * = e ° = aa e ° 
e 
S 1.5) of S 1.55 e S 1.57 a 1.51 
b b oO b 
So =} fo} fo} 
1.0 f 1 1 f . 1.0 . f . 1 L 1.0 1 f f 1 1 1.0 A 1 A A 1 
5 10 15 20 25 30 5 10 15 20 25 30 5 10 15 20 25 30 5 10 15 20 25 30 
MTP MTP MTP MTP 
Imhotep, lon = 139.5°, lat = -16.5° Ash, lon = 183.0°, lat = +30.0° Ash-Apis, lon = 203.0°, lat = +15.5° Anubis, lon = 242.0°, lat = -31.0° 
3.07 3.07 3.0 3.0 
2 E . 2 
53 25t 2 25 5 25[ = 25} 
ca) ee oD e © © 
& "'s S ° eve & i} fee % 
z 2.0 © e @G 2.0 oe e o® : 2.0 te? % eo,e = 2.0 le e 
@ e@ e 
=a e ° - 7 = eo e 5 °e., — = eee é 
(oe) (oe) (oo) e 
5 61.5} e 2 41.5+ 5 «1.5 e 5 1.55 
c 9 ° f : a 
tt Te) pe my 
lo) o lo) lo) 
1.0 1 1 1 1 1 1.0 fl 1 1 1 L 1.0 1 1 fl fl 1 1.0 1 1 1 1 1 
5 10 15 20 25 30 5 10 15 20 25 30 5 10 15 20 25 30 5 10 15 20 25 30 
MTP MTP MTP MTP 
Anubis—Atum, lon = 248.5°, lat = -34.5°  Hapi-Anuket, lon = 268.5°, lat = -9.0° Serget, lon = 313.0°, lat = +3.5° Hatmeit, lon = 350.5°, lat = +6.0° 
3.0 : 3.0 i 3.0 : 3.05 : 
a a a a 
a 2.5F a 2.5} a 2.55 ee a 25+ 
o o ® ee** co) © © 0 
Q ®o00 Q e Q ) Qa e e 
2 2 ceo Te) ‘eae 2 e e 2 bd 
% 2.0 ° ® 2.0} oo e 0 2.0/% & |, ® 2.0 fe® oe 
= e e € e e s E€ . e e ee 
= e = e a a 
(oe) ioe} e © (ce) 
5 «1.54 5 1.54 5 1.54 5 1.54 
z le gos vi 7 
iQ iQ ae) iQ 
oO oO oO oO 
iE 6s reek eee eee ir ae Oe sr 6) seer rere ver coro OO FO Ean rere rrr Pe [© Ere tre revere revere mere 
5 10 15 20 25 30 5 10 15 20 25 30 5 10 15 20 25 30 5 10 15 20 25 30 
MTP MTP MTP MTP 


Fig. 2| Time evolution of 67P nucleus colour measured through the 0.5- 
0.8 pm spectral slope above 12 control areas. Perihelion passage occurs 
during MTPO19 and is marked by the vertical dashed line. The morphological 
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regionnameand position (longitude, lon, and latitude, lat) of each area are 
reported above each plot. The maximum error associated with the slope 
measurements is of the order of +0.1um"™. 
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Fig. 3 | Simulations of the A,,,,,as a function of the scattering angle for 
spherical particles of different composition. a, Amorphous carbon. 

b, Magnesium silicate. c, Organic ice tholin (A,,,, > 0.65 pm for 0.10-0.99-tm- 
radius grains). d, Troilite. e, Water ice. f, 67P observations by VIRTIS. Median 


improvement, spectral indicator processing and data modelling, are 
given in Methods. 

From a synergistic time analysis of the coma and of the nucleus sur- 
face spectral properties, two opposite trends with heliocentric distance 
are observed. Coma integrated radiance increases (see /time series in 
Fig. 1a) and colour becomes redder when the comet approaches peri- 
helion (see time series of A,,,, and the 0.4—0.5 and 0.5-0.8 pm spectral 
slopes in Fig. lb—d). Conversely, on 12 control areas of the nucleus, for 
which we have continuous time coverage during the entire Rosetta 
mission, we observe a systematic blueing of the surface (see 0.5-0.8 
pm spectral slope time series in Fig. 2), with a reduction of the spectral 
slope up to 50% measured between the passage of 67P through the 
frost line (heliocentric distance 2.7 AU, occurring in October 2014 dur- 
ing inbound orbit and in April 2016 on the outbound) and perihelion 
(1.25 AU, August 2015). During the pre-perihelion period, we interpret 
the coma colour change as being the consequence of the progressive 
loss of the ice fraction in the dust grains ejected from the nucleus”°, 
which makes them redder. 

When the comet approaches the Sun, the dust grains lifted from the 
nucleus’ southern hemisphere—the one illuminated at that time—are 
warmer and become more dehydrated. Simulations based on Mie scat- 
tering theory applied to spherical grains of radius between 0.1and50 ym 
and five different compositions (see Methods for a discussion about the 
scattering model limitations induced by spherical shaped grains) are used 
tomodel VIRTIS spectralindicators, for example, A,,,,and the 0.5-0.8 pm 
spectral slope. Simulation results (Figs. 3, 4) are compatible with the 
presence of submicrometre grains made of organic matter and carbon 
during perihelion passage, which could be the result of the fragmentation 
ofthe mineral and organic matrix embedded in larger grains. During the 
outbound orbit, the simulations show that the organic material contin- 
ues to be present in the coma together with an increasing percentage 
of different grains (water ice or magnesium silicate, both of which are 
not fully compatible with the spectral indicator trends (Extended Data 
Table 3)), contributing to the progressive blueing observed in the data. In 
this orbital phase, the Sun shines again on the northern hemisphere, from 
wherea greater flux of surficial water-ice-rich grains”is lifted inthe coma. 


data computed 0n10° bin scattering angles (8) during different mission 
phases. Standard deviation computed on VIRTIS data are of the order of 
0.03 um. 


Conversely, the nucleus’ surface becomes progressively bluer during 
the inbound orbit because the greater solar input causes anincreasein 
the gaseous activity. This triggers dust ejection with consequent ero- 
sion of the surface: on average, a 0.5-m-thick erosion layer is lost during 
each orbit”. As aresult of this process, more pristine subsurface layers 
enriched in water ice are exposed on the surface, which turns bluer**””. 
We have evidence’, in fact, that water-ice-rich layers are immediately 
available under the dust layer, as shown in areas of recent disruption 
of the surface” and as demonstrated by many thermal models applied 
to cometary nuclei**”*. Between 20% (ref. “) and 50% (ref. *°) of the 
dust grains ejected around perihelion from the south pole—the region 
that receives the maximum insolation at that time—fall back prefer- 
entially on the northern hemisphere. The fall-back flux is dominated 
by decimetre-size dust aggregates in which a small fraction of water 
ice can be maintained”. After perihelion, as soon as activity begins 
to settle, the progressive accumulation of dust on the surface covers 
the exposed pristine layers again, making the nucleus surface redder 
again. This process explains the colour cycle observed by VIRTIS on 
the nucleus (Fig. 2). 

The colour of the 67P coma particles can be modelled with different 
grain populations made of water ice before perihelion, organic mate- 
rial and carbon at perihelion, and water ice or possibly magnesium 
silicate after perihelion. Similar composition endmembers are com- 
patible with the findings reported by previous Rosetta studies: during 
outbursts, the release of submicrometre-size water-ice grains mixed 
with hundred-micrometre-size refractory grains has been observed”. 
At the same time, a broad emission feature between 3.4 and 3.6 pm has 
been detected by VIRTIS”®, which is compatible with the sublimation 
of organic materials caused by the high temperatures of the grains. 
The colour reduction observed when the comet was far from the Sun 
resembles the spectral properties of water-ice grains, whose existence 
in the lower coma has been confirmed by other investigations: apart 
from the presence of submicrometre-size water-ice grains ejected 
during outbursts”, thermal models and Optical, Spectroscopic, and 
Infrared Remote Imaging System (OSIRIS) cameraimages” show that 
at 1.53 AU heliocentric distance, the total sublimation of ice for grains of 


Nature | Vol578 | 6 February 2020 | 51 


Article 


a Amorphous carbon b Troilite 
= ot Grain radius 0.10-0.99 um ~ 6b 
5 Grain radius 1.0-9.9 um 5 
Oo Grain radius 10-50 um oO 
Qo 4} Q 4- 
2 2 
no o 
1 ee E of 
fo) a fo) Pe tal oe en ce 
4 oe 8 : ee © & & 8 8 8 
Lie ee Pee eae ee ROP 
fo) fo) 
60 80 100 120 140 160 60 80 100 120 140 160 
6(°) 9(°) 
c Magnesium silicate d Water ice 
— — 
& 4t 2 at 
2 * 2 
no o 
& 2t .- & at 
o 7 Pee et oe) ae ee 
fo} a a a ee o mo ae gs ee : 
ob OF 8 FS Se ee, : * 6 Of ts : . 13 
fo} 1 1 fo) cl ‘wer’ 
60 80 100 120 140 160 60 80 100 120 140 160 
9(’) 9 (°) 
e Organics f VIRTIS 
=~ 6b . ~ 6b MTP015 
£ € MTP016 
= a MIP020 
@ 4 6 gy MTP021 
2 47 Sore MTP027 
G f a 7) —. MTP028 
E of. °% © . . € of * * 7 ey : ew 
0 “8 od . . 
° ey e . : e 
3 of ret PF © ws d 10: Be cs ~ abe ake a OSE ca, OW, Re Se. 
Oo ¥. & 1 n Oo 
60 80 100 120 140 160 60 80 100 120 140 160 


8°) 8(°) 
Fig. 4 | Simulations of the 0.5-0.8 jm spectral slope as a function of the 
scattering angle for spherical particles of different composition. a, 
Amorphous carbon. b, Troilite.c, Magnesium silicate. d, Water ice. e, Organics. 
f, Median spectral slope derived from VIRTIS observations for pre-perihelion 
(MTPO15 and MTPO16), perihelion (MTP020 and MTPO21) and post-perihelion 
(MTPO27 and MTPO28) mission phases. The standard deviations of the VIRTIS 
dataare<0.6um". 


radius between 5.and 500 pm made of ice-dust layers or mixtures occurs 
well beyond the 2.5 km distance from the 67P nucleus considered here. 
This implies that water-ice grains can populate the region of the coma 
analysed in this study. Finally, aphotometric investigation of the visible 
colours of more than 1,000 individual grains resolved on OSIRIS colour 
images of the 67P coma” indicates the presence of three composition 
classes: organic material grains (steep colour slope), mixtures of silicate 
and organic material (intermediate slope), and water ice (flat slope). 
The presence of carbon, silicates and magnesium in 67P dust grains has 
been assessed by the Cometary Secondary Ion Mass Analyser (COSIMA) 
instrument on Rosetta”. 

The fact that the coma’s and the surface’s colour spectral indica- 
tors return to the same values at the beginning and at the end of their 
time series (Fig. 1), for example, at heliocentric distances greater than 
2.7 Au, is evidence of the establishment of an orbital water-ice cycle 
driven by the solar heating. Far from the Sun, with the settling down 
of the gaseous activity’, a tenuous (/= O in Fig. 1a) coma surrounds a 
red-coloured dusty nucleus. 
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Methods 


VIRTIS dataset 

While the scientific objectives of the Rosetta mission”, including the 
study of the dust in the coma, were well-defined before the spacecraft 
reached 67P, the detailed planning of the scientific payload opera- 
tions has been a complex and continuously evolving process due to 
the uncertainties of the cometary environment in which the mission 
had to operate. A detailed overview of the Rosetta science planning 
process and challenges is given in ref. *’. In this context, VIRTIS coma 
observations were performed from very different viewing and illumi- 
nation geometries, spacecraft ranges from the nucleus, heliocentric 
distances and cometary activity levels, resulting in a dataset showing 
avery variable spatial resolution and signal-to-noise ratio. 

While VIRTIS-led targeted observations of the coma have been 
routinely executed, a large fraction of the VIRTIS dataset consisted of 
ride-along and serendipitous acquisitions that were acquired while 
the spacecraft attitude and pointing were controlled by other instru- 
ments. For this reason, to maximize the retrieval of information about 
the dust particles properties, a statistical approach has been applied to 
the vast VIRTIS dataset with the scope to exploit it in its completeness. 

We restrict our analysis to only the VIRTIS-M (imaging channel) vis- 
ible data covering the 0.25-1.0 pm spectral range with 432 bands anda 
spectral sampling of less than 2 nm per band. A detailed description of 
the VIRTIS experiment is provided by ref.”. In contrast to the VIRTIS-M 
infrared channel operating within the 1-5 um spectral range, which 
ceased operation in May 2015 due to a permanent failure of the cryo- 
cooler cooling of the infrared channel detector, the visible channel has 
operated flawlessly for the entire duration of the mission. 

For this study, we processed coma data collected during Rosetta’s 
MTPO12-MTPO28 periods. The time intervals of each medium-term 
phase (MTP) are listed in Extended Data Table 1. The dataset consists of 
more than 4,500 hyperspectral cubes of the coma of 67P acquired froma 
range of distances from the nucleus between 27.8 and 1,462.2 km, result- 
ing ina spatial resolution between 7 and 365 m per pixel, respectively. 

Each VIRTIS observation has been calibrated in spectral radiance” 
and specific geometry parameters have been calculated for each pixel 
falling on the coma region and the nucleus. Further details about the 
VIRTIS data mapping methods are given in ref. °. The nucleus solar 
phase and the spacecraft position and distance in the Cartesian frame 
centred on the nucleus have been calculated using SPICE kernels”. 

The spectral parameters used in our analysis to study the coma emis- 
sion are calculated in an annulus defined by a tangent altitude running 
between d,,,,=1km and @,,,,=2.5 km around the nucleus. This criterion 
is adopted independently from the nucleus-spacecraft distance andthe 
solar phase angle at the time of the observation. The selection of pixels 
placed at distances larger than 1km allows (1) reduction of the contribu- 
tion of the instrumental stray light coming from the illuminated part of 
the nucleus and (2) maximization of the coverage across the dataset. 

An example of a VIRTIS-M visible image of the coma showing the 
annulus definition is given in Extended Data Fig. 1 (top panels). This par- 
ticular image refers to observation MTPO19 - STP068 - V1_00397724286 
acquired on 9 August 2015 froma distance of 304 km, resulting in aspa- 
tial resolution of 75 m per pixel with an integration time of 16s per line. 

Sucha viewing geometry is not very common because the complete- 
ness of the annulus over the 360° azimuth (up to distance a,,,,) is limited 
by the instrumental field of view (FOV = 3.6°), by the relative distance 
(d) between the Rosetta spacecraft and the comet’s surface, and by the 
spacecraft off-nadir pointing direction (a). Only when the following 
condition is verified, VIRTIS-M can acquire the full annulus: 


+ 
at aretan( <™=*" <FOV (1) 


wherer=1.74 kmis the average radius of the nucleus™. A similar condi- 
tionis verified in only 248 observations (marked as red points in Fig. 1a), 


corresponding to about 5% of the dataset: as a consequence, integrated 
radiance and maximum emission wavelength values could be biased 
by the incompleteness of the annulus on the remaining observations. 
The average integrated radiance on incomplete annulus observations 
(black points) appears, in general, higher than on the ones where the 
annulus is entirely acquired (red points). This happens because VIRTIS 
is preferentially observing the coma towards the solar direction to 
maximize the signal-to-noise ratio. Apart from the above limitations, 
which introduce some scattering in the data points, the value of the 
integrated radiance is mainly driven by the number of scatterers (dust 
grains) along the line of sight, rather than by their composition and 
grain size distribution (which the spectral slopes and the spectral posi- 
tion of the radiance peak depend on). 

The overall coma brightness and spectral properties measured by 
VIRTIS are driven by light scattering on dust particles and are not altered 
by emissions from gas molecules in the coma. The instrument sensitiv- 
ity at visible wavelengths, in fact, is not enough to measure gaseous 
emissions from CN, C, and C,. 


Calibration update using stellar observations 
With the aim of improving the VIRTIS-M visible response in the 0.25- 
0.45 um spectral range, where the standard pipeline suffers large uncer- 
tainties due to stray light introduced by the on-ground calibration setup, 
we applied a radiometric correction based on stellar observations. The 
average signal derived from Vega (a Lyrae) observations listed in Extended 
Data Table 2 is used as a reference to correct the standard responsivity. 
Four observations of Vega acquired with the longest possible integra- 
tion time, 50s, were used to retrieve the raw signal above the noise level 
(about +2 data numbers (DN)) estimated on the nearby deep-sky pixels. 
Before this step, each observation was dark-level-subtracted and 
despiked (Extended Data Fig. 2 (top-left panel), showing the signal and 
sky level for one of the four Vega observations). Owing to the instrumen- 
tal point spread function and spectral tilt, both described in detail in 
ref.*®, the Vega spectral signal is distributed across 18 spatial pixels and 
is averaged above these. The four observations were processed in the 
same way before being further averaged to improve the signal-to-noise 
ratio. The resulting average Vega signal is shown in Extended Data Fig. 2 
(top-right panel). The fractional deviation of the single spectra from the 
average is about 0.05. The Vega flux in ref. *° is convolved with the VIRTIS 
Gaussian response (full-width at half-maximum, 2.3 nm) and then bya 
4-nm-wide boxcar. The spectra were then placed on an absolute wave- 
length scale using the centre of Balmer and Paschen intense absorption 
lines as references and assuming a constant dispersion. The spectral 
dispersion (in nm) applied is A(b) = 232.9 + 1.88515 x b, with b ranging 
from 0to 431, corresponding to the VIRTIS-M number of spectral bands. 
The resulting relative responsivity derived as the ratio between VIRTIS 
signal and Vega flux is shown in Extended Data Fig. 2 (bottom-left panel). 
To further remove residual high-frequency noise due to the low Vega 
signal, anine-band running boxcar filter was applied. The response 
curves were normalized at the value of 58.75 at A = 0.635 pm to allow 
a direct comparison with the standard pipeline responsivity shown 
in Extended Data Fig. 2 (bottom-right panel). Within the limits of the 
low Vega signal in the blue spectral range (less than 40 DN at 0.55 pm), 
this method allows the retrieval of a better responsivity at shorter 
wavelengths with respect to the standard pipeline®. As shown by the 
ratio plot (blue curve), the standard pipeline overestimates the target 
radiance for wavelengths A < 0.4 pm. In the remaining spectral range, 
the differences are much smaller. As the wavelength of the dust emis- 
sion spectral radiance peak occurs between 0.45 and 0.55 pm, such an 
instrument-response correction allows improvement of the retrieval of 
the left wing of the radiance peak and the spectral slope determinations. 


Coma spectral indicators 
After averaging the coma signal within the annulus previously defined 
and applying the updated responsivity function, we calculated four 
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spectral indicators for each observation. The indicators are as fol- 
lows. (1) The integrated radiance (/) across the visible spectral range, 
defined as: 


1pm 
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where R(n,A) is the spectral radiance measured on the nth pixel of the 
annulus at wavelength A and Nis the total number of pixels within the 
annulus area. (2) The wavelength of maximum emission of the radiance 
(Amax) Measured ona fourth-degree fit on the average spectral radiance 
calculated on the annulus. 

After converting spectral radiance to irradiance/solar flux (I/F), 
we calculated the two spectral slopes following ref. °. These are: (3) 
0.4-0.5 um spectral slope on //F; and (4) 0.5-0.8 pm spectral slope on 
1/F. The two spectral slopes are calculated as the angular coefficient 
of the best linear fit inthe ranges 0.4—0.5 and 0.5-0.8 pm after having 
normalized //F at 0.5 um. The determination of the spectral indicators 
follows the scheme shown in Extended Data Fig. 1 (bottom panels). 

Spectral indicators depend on illumination geometries and the 
spacecraft-nucleus distance: as a consequence of the demanding mis- 
sion scenario, complex orbits, including terminator, icosahedron, 
flybys and far excursions, have been flown”, resulting in a wide range 
of distances and solar phases. The correlation between the integrated 
radiance /and distance is shown in Extended Data Fig. 3, where VIRTIS-M 
measurements are co-located with Rosetta’s position and solar phase 
angle with respect to the comet nucleus along the flight trajectory. 


Temporal evolution of the coma colour 

The integrated radiance (/) time series (Fig. la) shows an increase in 
the coma’s brightness starting from January 2015 (MTPO12, heliocen- 
tric distance 2.55 AU) to perihelion passage (MTPO19, 1.24 AU, August 
2015). Owing to the characteristics of Rosetta’s orbit around the nucleus 
and the different observation strategies implemented during the mis- 
sion, the completeness of the annulus region on the VIRTIS dataset is 
reached on only alimited number of observations, marked as red points 
in Fig. 1a. For the majority of the remaining observations (black points), 
partial coverage of the annulus is achieved. Notwithstanding the rela- 
tive scatters of the data introduced by the variability of the observ- 
ing conditions, some clear trends are identified. A local maximum, 
25<1<30Wm°sr_|, is measured about one month after perihelion. 
Two additional intensity peaks are measured at the end of August and 
September 2015, when a maximum intensity of about 50 W m“sr‘is 
reached. These peaks are caused by energetic cometary activity—two 
outbursts observed by VIRTIS on 13-14 September 2015 are reported 
by refs. °°”—but appear also to be strongly correlated with the peculiar 
viewing geometry and spacecraft position with respect to the nucleus: 
in fact, at the time of these observations, Rosetta was performing far- 
nucleus excursions, orbiting at greater distances than usual from the 
nucleus, up to 450 and 1,462 km, respectively. As aconsequence of the 
increased distance from the nucleus, VIRTIS-M has recorded higher 
radiances boosted by the larger column density of dust particles along 
the line of sight and by the maximum activity occurring immediately 
after perihelion. After this period, the integrated radiance drops rap- 
idly from MTPO21 to MTPO24 where an average value of 5W m° sr ‘tis 
reached. The maximum integrated radiance measured in MTPO20 and 
MTPO21is about a factor of 10 more intense than the average value of 
5 Wm’ sr‘ measured when the comet was far from the Sun (greater 
than 2 AU) on both inbound and outbound legs of the orbit. With the 
exclusion of the two peaks, a similar asymmetric, or cusp-like trend 
with a maximum occurring about one month after perihelion, has been 
measured by other Rosetta’s instruments for the water production 
rate, which is associated with the ejection of the dust’. Apart from the 
greater column density, we have clues that a larger particle albedo 
could contribute to the increase of the integrated radiance: according 


to ref. *, the albedo of the dust particles increases up to 20% near peri- 
helion for observations acquired at a 90° solar phase angle. Similar 
albedo changes could be the result of a different composition of the 
dominant grain population in the coma along the orbit, as discussed 
in the main text. 

The time series of the wavelength of maximum emission (A,,,,) 
(Fig. 1c) shows a cusp-like trend starting from low values dispersed 
around A,,,, = 0.45 um on January 2015 (MTPO12) on the inbound leg 
of 67P trajectory toA,,,, > 0.5 um during perihelion passage (MTPO19). 
Moving along the outbound leg of the orbit, the wavelength of maxi- 
mum emission shifts progressively towards shorter wavelengths, 
reaching again A,,., = 0.45 um in mid-May 2016 (MTPO28). Whereas 
Amax and integrated radiance time series show a similar time trend, 
spectral slopes time series (Fig. 1b, d) are characterized by a different 
evolution: at the two extremes of the time series, both 0.4-0.5 pm and 
0.5-0.8 pm slopes are almost neutral whereas at perihelion they reach 
values dispersed at 5 and 3 pm“, respectively. The visible colours of 
the coma showasystematic reddening before perihelion passage and 
a progressive blueing after it. The concurrent variations occurring on 
all spectral indicator time series point to a change in the dust grain 
composition and grain size distribution with the heliocentric distance. 


Temporal evolution of the nucleus colour 

When observed at a global scale, 67P surface dust appears very dark 
(geometric visible albedo 6.2%; ref. °°) and dehydrated, with an average 
water-ice fraction of about 1% dispersed in submicrometre grains*”*®. 
The residual water-ice content, if present, on the dust grains ejected 
fromthe surface undergoes rapid sublimation once the grainis lifted”°. 
So far, infrared spectroscopic observations have confirmed the pres- 
ence of only two ices on the surface of 67P nucleus: water ice”**' and 
carbon dioxide”. 

The lowthermal capacity of the dust (700J kg?K7*) and the high solar 
flux can rapidly increase the temperature of the grains causing the 
sublimation of the volatiles. A colour temperature as high as 630 K is 
measured on dust grains during outbursts” and in the 260-320 K range 
on quiescent coma‘. Furthermore, the grains observed by Rosetta’s dust 
instruments appear dehydrated**’. This probably happens because 
the instruments aboard the spacecraft are not kept at cryogenic tem- 
peratures, allowing the dispersion of the volatile fraction of the grains 
after their collection. Besides dehydration, grain size distribution also 
plays a role in the changes of the visible colours (see ‘Modelling dust 
grain composition and size distribution in the coma’). 

During the pre-perihelion phase, the nucleus’ surface shows a pro- 
gressive blueing until one month after perihelion when the minimum 
slope is reached in MTPO20 (Fig. 2). In this phase, the gas activity 
removes the dust from the surface with greater efficiency, resulting in 
the exposure of more pristine subsurface material enriched inices**”. 
This trend is interrupted after the perihelion passage when a progres- 
sive surface reddening is observed. Along the outbound trajectory, 
the gaseous activity progressively settles, resulting in the accumula- 
tion of dehydrated (red) dust on the surface. This appears to bea gen- 
eral mechanism, occurring on the majority of the nucleus surface, as 
shown in Fig. 2, where the average 0.5-0.8 pm slope is shown for twelve 
30 m x 30-m-wide control areas at one-month time resolution. These 
control areas, selected between latitude +45.5° and —34.5°, are the 
ones that have been observed in daylight by VIRTIS during the entire 
duration of the mission, allowing us to follow the seasonal colour cycle 
developing on the nucleus. 


Dust-scattering simulations 

The simulations are computed following the Bohren—-Huffman Mie scat- 
tering code“ to calculate scattering and absorption by ahomogenous 
isotropic sphere. The method relies on the following assumptions. 
(1) Single-scattering approximation, for example, the solar photons 
interact with only one particle before reaching the observer. Sucha 


hypothesis is reasonable for the annulus region considered in this 
study, where the optical depth is small. (2) Polydisperse spherical grain 
size distributions with radii of 0.10—0.99 xm (90 bins, 0.01 pm per 
bin), 1.0-9.9 pm (90 bins, 0.1 um per bin) and 10-50 um (40 bins, 1 um 
per bin). (3) Five different grain compositions are simulated: water 
ice (optical constants from ref. *°), organic material*, troilite*, amor- 
phous carbon* and magnesium silicate*®. These endmembersare rep- 
resentative of the cometary nuclei composition, which is made up of 
a macromolecular assemblage in which various organic components 
(both aromatic and aliphatic), minerals (including silicates, iron sul- 
phides like pyrrhotite and/or troilite, and possibly ammoniated salt) 
and ices are mixed together®. (4) Spectral simulations are performed 
at 27 wavelengths within the 0.35-1.0 im spectral range. (5) Scattering 
angle 6 (=180° — g, where gis the solar phase angle) is computed at a 
step of 1° from 55° to 155° and then averaged on 4° bins to match the 
VIRTIS FOV (3.66°). 
Following ref. °°, the single scattering intensity is given by: 


1A) = SA yna(p)aq(y PE” Q) 


where S(A) is the solar flux™ scaled at the comet’s heliocentric distance, 
Ngo iS the dust column density calculated along the line of sight from 
an observer placed at distance p from the nucleus, ais the grain geo- 
metrical cross-section, q(A) is the scattering efficiency ofa single par- 
ticle and p(g, A) is the scattering phase function calculated at phase 
g. Rather than modelling the absolute value of the intensity, we limit 
here our analysis to the changes occurring in the spectral response of 
the average radiance emitted from the dust. In particular, we focus on 
the changes of the wavelength of the maximum emission (Fig. 1c) and 
0.5-0.8 pm spectral slope (Fig. 1d) observed at different heliocentric 
distances. To achieve this goal, we calculated the spectral radiance fac- 
tor, (A) expressed in arbitrary units, derived from Eq. (2) by removing 
the factors depending on the dust grain distribution and properties: 


1A, g) = StayqayPS ) 


The phase functions show scattering characteristics depending on 
composition and grain sizes: as a general rule, transparent grains, like 
water ice, are characterized by a very intense and collimated forward- 
scattering peak and complex resonance peaks at intermediate phase 
angles. Opaque materials, like troilite, show an almost isotropic scat- 
tering function. Semi-transparent particles, like ice tholins, have an 
intermediate behaviour with a prevalence of backscattering response. 
Grain size plays a fundamental role inthe scattering mechanism: in the 
Rayleigh regime (particle size much smaller than the wavelength), light 
is equally forward- and backscattered while in the geometric regime 
(particle size much larger than the wavelength), backscattering domi- 
nates. 

Fixing composition and grain size, the corresponding phase func- 
tions at the 27 visible wavelengths allow the derivation of the spectral 
radiance according to Eq. (3). 

While p(g, A) has asmooth response in the Rayleigh and in the geo- 
metric scattering regimes, the simulations, in general, show more fluc- 
tuations occurring for certain combinations of composition, grain size 
and @ angle when the grain sizes are comparable to the wavelength, 
for example, at about 1 pm. To mitigate this effect, we computed 
simulations for 25° < g < 125°, corresponding to scattering angles of 
55° < 8 < 155°, at steps of 1° and then averaged the angular response 
of p(g, A) on 4° bins, similar to the VIRTIS FOV. Similarly, the p(g) was 
computed for single grain radius (0.10-0.99 pm (90 bins, 0.01 1m per 
bin), 1.0-9.9 pm (90 bins, 0.1 um per bin) and 10-50 um (40 bins, 1 um 
per bin)) and then averaged for each of the three families. Following this 
method, the /(,g) (from Eq. (3)) was derived for each composition and 


size distribution. The A,,,, and, after conversion in //F, the 0.5—0.8 um 
spectral slope were derived from the simulated /(A,g). The correspond- 
ing values are shown in Figs. 3, 4, respectively. 


Modelling dust grain composition and size distribution inthe 
coma 

To disentangle the viewing and illumination geometry effects from 
particles’ physical properties and to provide a more quantitative assess- 
ment of the changes observed in the A,,,,, time series, the light scattering 
on grains of different composition and radius was simulated. Optical 
and insitu measurements of the dust grain distribution in the 67P coma 
by Rosetta’s instruments have revealed that the particles lie mainly 
within the 0.1 pm-1 mm diameter range™. For particles smaller than 
1mm, the dust size distribution follows a power-law distribution witha 
coefficient varying between —2 for heliocentric distances beyond 2 AU 
and -3.7 at perihelion. Conversely, large grains (diameter greater than 
1mm) showamuchsteeper distribution witha coefficient of —4 (ref. °°). 
As small grains are the predominant population, in our analysis we 
preferentially model the spectral behaviour of small (radius less than 
50 um), compact grains. A similar grain size range has been also used 
by refs. 7°” to model dust optical properties at infrared wavelengths 
and by ref. 7° to study thermal evolution and depletion of ice in dust 
aggregates along jets. The VIRTIS data are interpreted through the Mie 
theory, whichcan simulate the scattering properties of spherical grains 
of radius R and homogeneous composition having x= (2T1R/A) <1,000. 
Such a limit corresponds to a maximum grain radius R = 50 pm inthe 
visible spectral range (wavelength 0.35 <A < 1.0 um). To simulate the 
spectral response of larger grains, including fluffy aggregates”, it is 
necessary to use a geometric optics approximation®, which is beyond 
the scope of this work. 

The changes occurring in the spectral radiance scattered by spherical 
grains of different composition and grain size distribution as a function 
of the phase angle (g) are calculated for homogeneous particles made of 
possible cometary material endmembers, including water ice, organic 
ice tholin, troilite, amorphous carbon and magnesium silicate. For a 
given composition and phase angle, the single particle phase function 
p(g,A) modulates the intensity of the scattered spectral radiance. 

The theoretical trends of A,,,,. aS a function of the scattering angle 
6=180° - g for particles of different composition and three grain size 
distributions (radius 0.10-0.99 yum, 1.0-9.9 pm and 10-50 pm) con- 
sidered in this study, are shown in Fig. 3. In the same figure, we show 
the distribution of /,,,, as measured from VIRTIS observations (Fig. 3f) 
selected during six MTP periods encompassing pre-perihelion (MTPO15 
and MTPO16), perihelion (MTPO20 and MTPO21) and post-perihelion 
(MTPO27 and MTPO28) orbital phases. These periods are chosen 
because they offer the widest spread in scattering angle necessary 
to discriminate among different compositions and grain size distri- 
butions. Our analysis shows that, with the exception of transparent 
grains, for example, water ice and organic ice tholin, the response of 
opaque materials like amorphous carbon (Fig. 3a) and troilite (Fig. 3d) 
are characterized by aconstantA,,,, at about 0.5 pm for the intermediate 
(1.0-9.9 um) and large (10-50 pum) grain size distributions across the 
60 <0<150° scattering angle range explored by VIRTIS. These grains 
are therefore not matching the observed distribution of the /,,,,. Con- 
versely, opaque-material submicrometre grains show a significant shift 
of the A, towards the red range, especially for 8 < 120°. 

Submicrometre grains made of magnesium silicate (Fig. 3b) show 
a linear increasing trend of A,,,, from 0.56 pm at 8 = 60° to 0.63 um at 
@=150°, whichis not compatible with VIRTIS data. In contrast, the,,,, 
of grains with radii larger than 1m is dispersed around 0.55 pm, a value 
too red to reproduce VIRTIS data at 8 > 100° and too blue for 8< 100°. 

Water-ice grains (Fig. 3e) offer the only solution matching VIRTIS 
data taken during the pre- (MTPO15 and MTPO16) and post-perihelion 
periods (MTPO27 and MTPO28) at scattering angles 0 > 100°. During 
these phases, VIRTIS data are dispersed at A,,,,, < 0.5 pm, a behaviour 
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compatible with only the three water-ice grain-size distributions we 
have simulated. 

Apart water ice, organic material grains (Fig. 3c) are the second end- 
member showing spectral similarities compatible with 67P particles: 
the 1-10 pm and 10-50 pm grain-size responses are in fact characterized 
by alinear decrease of /,,,, from 9= 60° to 120°, making them compat- 
ible with observations acquired during perihelion and post-perihelion 
periods. In contrast, submicrometre grains show an extreme red colour 
across the entire range of scattering angles: having A,,,, = 0.65 pm, they 
are marginally compatible only with VIRTIS observations taken at low 8 
angles. Ifone assumes that during one month, for example, during the 
duration of one MTP period, the dominant composition and grain size 
is preserved, then the VIRTIS measurements taken during MTPO28 at 
8>130° could be compatible not only with water ice but also with large 
(10-50 pm) organic material grains. As shown in Fig. 3a, amorphous 
carbon submicrometre-size (0.1-0.9 um) grains are characterized 
by a reddening of A,,,, for 9 < 90°, similar to VIRTIS but with absolute 
values too low to exactly match the observations. Instead, the match 
is compatible for 90° < 8 < 120°, making submicrometre amorphous 
carbon grains a good interloper between water-ice and organic grains. 
Finally, magnesium silicate (Fig. 3b) and troilite (Fig. 3d) submicrometre 
grains show ,,,, trends at low scattering angles not compatible with 
VIRTIS data. 

Composition and grain size distribution of the dust particles were 
further analysed through the 0.5-0.8 ppm spectral slope (Fig. 4) derived 
form VIRTIS data, which shows a peaked distribution at 0 = 100-110° 
(Fig. 4f) occurring in perihelion (MTPO20 and MTPO21) and post peri- 
helion (MTPO28) sequences. In our simulations, a similar behaviour is 
compatible with submicrometre-size grains made of organic matter 
(black points in Fig. 4e) or with greater than 1 pm water-ice grains (blue 
and red points in Fig. 4d). The absolute value of the slope on the peak 
(about 3 pm“) measured at perihelion (MTPO20, cyan points) is similar 
to theoretical values for organic matter. Being transparent, water-ice 
particles are characterized by a peculiar behaviour: submicrometre 
grains show a blue colour response (negative slope, black points in 
Fig. 4d) not compatible with any VIRTIS observation, whereas large 
grains (blue and red points) are neutral to moderately red and could 
contribute to the VIRTIS observed slope at any scattering angle. Opaque 
grains with sizes greater than 11m show an almost constant neutral to 
moderately red (0.5 4m) slope within the entire scattering angle range 
(see red and blue points for amorphous carbon in Fig. 4a and troilite 
in Fig. 4b). We remark that while the A,,,, trends of magnesium silicate 
(Fig. 3b) grains are not compatible with VIRTIS data, their spectral 
slopes values (Fig. 4c) for radii greater than 1 pm are remarkably similar 
for measurements taken during the post-perihelion period. 

Extended Data Table 3 shows the compatibility scheme of the /,,,, 
and 0.5-0.8 pm spectral slope values for the different composition 
endmembers, grain radius and scattering angle compared with the 
VIRTIS data. 


Data availability 


The VIRTIS calibrated data are publicly available through the European 
Space Agency’s Planetary Science Archive website (https://archives. 
esac.esa.int/psa/). 
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Extended Data Fig. 1| Example of atypical VIRTIS-M observation of 67P 
nucleus and coma. Top left: visible colours RGB = (0.7, 0.55, 0.44 1m) image, 
stretched to saturate the nucleus and enhance the visibility of jets inthe coma. 
Top right: tangent altitude image where the green mask corresponds tothe 
annulus containing all pixels witha tangent altitude betweenland2.5kmfrom 
the limb. Bottom left: average radiance spectra as derived from official 
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pipeline (black curve) and after correction with Vega data (red curve). The 
fourth-degree polynomial fit to the corrected radiance is shown (cyan curve). 
The retrieved maximum emission wavelength on the fit is indicated by the 
magenta dashed line. Bottom right: corresponding //F spectranormalized at 
0.5 um. The best-fitting slopes in the 0.4—-0.5 and 0.5-0.8 pm ranges are 
indicated by blue and green dashed lines, respectively. 
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Extended Data Fig. 2| Vega star signal and derived VIRTIS responsivity. Top 
left: average spectrum of Vega, in DN, from observation V1_00402035638.QUB 
(red curve) and spectrum corrected for dark current, despiked and filter notch 
removed (green curve). Note the correlation of negative spikes in the Vega and 
average sky spectrum (black curve). Owing to the instrumental point spread 
function and spectral tilt, the signal is an average taken from 18 pixels. Top 
right: average spectrum of Vega derived from the four observations listed in 
Extended Data Table 2 after having applied a processing similar to the one 
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shownin the previous plot. The curve is averaged with a two-point running 
boxcar filter. The Vega flux®’ is shown in relative units (blue curve). Bottom left: 
VIRTIS responsivity derived from Vega signal averaged witha two-point 
running boxcar filter (red curve) and nine points (blue curve) after 
normalization at 0.635 pm above the standard responsivity value. Bottom 
right: comparison between standard pipeline (black curve) and Vega 
responsivities with a nine-point running boxcar filter (red curve). The ratio 
between the two responses is the blue curve. 
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Extended Data Fig. 3| Rosetta spacecraft three-dimensional trajectory and 
solar phase angle variations with time. Top: Rosetta trajectory inthe 67P XYZ 
reference frame. Points along the X axis are shown starting from Rosetta’s 
position at 2015-01-13T23:28:53 (MTPO12) with an increment of lkmevery 

20 minto improve visualization. The Yaxis is oriented towards the Sun and the 


Phase (deg) 


t+10* 


MTP028 


~o 6°10" 
_.~-5°t0’ e 


120 


6°10" 
: 5°10" 
4°10 


310° 


X (km)+time offset 


210° 


Zaxis is perpendicular to the orbital plane. The red line indicates the position 
of the nucleus along the X axis. The integrated radiance as measured on each 
observation is reported according to the colour scale. Bottom: variation of the 
solar phase angle (Sun-nucleus centre-Rosetta) during the mission. 
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Extended Data Table 1| Rosetta’s calendar MTP periods dates and heliocentric distance 


End Time 


Heliocentric distance (AU) 


Period Start Time 
MTP012 | 2015-01-13T23:28:53 


MTP013 | 2015-02-10T23:23:53 


2015-02-10T23:23:53 
2015-03-10T23:23:53 


2.552-2.344 
2.344-2.130 


MTP014 | 2015-03-10T23:23:53 
MTP015 | 2015-04-08T 23:23:53 


2015-04-08T 23:23:53 
2015-05-05T 23:23:53 


2.130-1.967 
1.967-1.703 


MTP016 | 2015-05-05T23:23:53 
MTP017 | 2015-06-02T23:23:53 


2015-06-02T 23:23:53 
2015-06-30T 23:23:53 


1.703-1.508 
1.508-1.350 


MTP018 | 2015-06-30T23:23:53 
MTP019 | 2015-07-28T23:23:52 


2015-07-28T23:23:52 
2015-08-25T 23:23:52 


1.350-1.257 
1.257-1.253 


MTP020 | 2015-08-25123:23:52 
MTP021 | 2015-09-22T23:23:52 


2015-09-22T 23:23:52 
2015-10-20T23:23:52 


1.253-1.340 
1.340-1.494 


MTP022 | 2015-10-20T23:23:52 
MTP023 | 2015-11-171T23:23:52 


2015-1 1-171T23:23:52 
2015-12-15123:28:52 


1.494-1.686 
1.686-1.897 


MTP024 | 2015-12-15123:28:52 


2016-01-121T23:28:52 


1.897-2.113 


MTP025 | 2016-01-12T23:28:52 | 2016-02-09T23:28:52 2.113-2.327 
MTP026 | 2016-02-09T23:28:52 | 2016-03-08123:28:52 2.327-2.536 


2016-04-05123:28:52 


2.536-2.738 


MTP027 | 2016-03-08T23:28:52 
MTP028 | 2016-04-05T23:28:52 


2016-05-03T23:27:51 


2.738-2.933 


Extended Data Table 2 | List of VIRTIS observations of the Vega star 


_ 


V1_00402035638.QUB | 2015-09- 2015-09-28T | (432,256,29) 
281T04:35:17.781 | 05:04:11.405 
V1_00403369562.QUB | 2015-10- 2015-10-13T | (432,256,31) 
131T15:07:22.178 | 15:38:15.817 
V1_00403848118.QUB | 2015-10- 2015-10-19T | (432,256,31) 
19T04:03:18.373 | 04:34:12.013 
V1_00406049718.QUB | 2015-11- 2015-11-13T | (432,256,27) 
13T15:36:39.065 | 16:03:32.757 
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Extended Data Table 3 | Summary of spectral indicator compatibility 


Grain radius Scattering Angle (deg) 
(um) 

0.10-0.99 
Amorphous Carbon _/1.0-9.9 
10-50 
0.10-0.99 
Mg-silicate 1.0-9.9 
10-50 
0.10-0.99 
Organics 1.0-9.9 
10-50 
0.10-0.99 
Troilite 1.0-9.9 
10-50 
0.10-0.99 
Water Ice 1.0-9.9 
10-50 
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The table shows the compatibility of spectral indicators (A,,,,, 0.5-0.8 um spectral slopes) between VIRTIS observations and Mie scattering simulations for a given composition, grain radius 
range and scattering angle. The compatibility between observations and scattering simulations is colour coded according to the key shown on the top right. Each cell is divided in four fields 
showing A,,,and to 0.5-0.8 um slope values during pre-perihelion, perihelion and post-perihelion epochs (bottom right key). 
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The use of accelerated beams of electrons, protons or ions has furthered the 
development of nearly every scientific discipline. However, high-energy muon beams 
of equivalent quality have not yet been delivered. Muon beams can be created 


through the decay of pions produced by the interaction of a proton beam witha 
target. Such ‘tertiary’ beams have much lower brightness than those created by 
accelerating electrons, protons or ions. High-brightness muon beams comparable to 
those produced by state-of-the-art electron, proton and ion accelerators could 
facilitate the study of lepton—antilepton collisions at extremely high energies and 
provide well characterized neutrino beams’ ®. Such muon beams could be realized 
using ionization cooling, which has been proposed to increase muon-beam 
brightness”*. Here we report the realization of ionization cooling, which was 
confirmed by the observation of an increased number of low-amplitude muons after 
passage of the muon beam through an absorber, as well as an increase in the 
corresponding phase-space density. The simulated performance of the ionization 
cooling system is consistent with the measured data, validating designs of the 
ionization cooling channel in which the cooling process is repeated to producea 
substantial cooling effect? ". The results presented here are an important step 
towards achieving the muon-beam quality required to search for phenomena at 
energy scales beyond the reach of the Large Hadron Collider at a facility of equivalent 


or reduced footprint®. 


High-quality muon beams 

Fundamental insights into the structure of matter and the nature of its 
elementary constituents have been obtained using beams of charged 
particles. The use of time-varying electromagnetic fields to produce 
sustained acceleration was pioneered in the 1930s” “. Since then, high- 
energy and high-brightness particle accelerators have delivered elec- 
tron, proton and ion beams for applications ranging from the search 
for new phenomena in the interactions of quarks and leptons to the 
study of nuclear physics, materials science and biology. 

Muon beams can be created using a proton beam striking a target to 
produce a secondary beam comprising many particle species includ- 
ing pions, kaons and muons. The pions and kaons decay to produce 
additional muons, which are captured by electromagnetic beamline 
elements to produce a tertiary muon beam. Capture must be realized 
onatimescale compatible with the muon lifetime at rest, 2.2 us. Without 
acceleration, the energy and intensity of the muon beam is limited by 
the energy and intensity of the primary proton beam and the efficiency 
with which muons are captured. 

Accelerated high-brightness muon beams have been proposed 
as a source of neutrinos at neutrino factories and for the delivery of 
multi-TeV lepton-antilepton collisions at muon colliders’ °®. Muons 
have attractive properties for the delivery of high-energy collisions. 
The muon is a fundamental particle with mass 207 times that of the 
electron. This high mass results in suppression of synchrotron radia- 
tion, potentially enabling collisions between beams of muons and 


antimuons at energies far in excess of those that can be achieved in 
an electron-positron collider, such as the proposed International 
Linear Collider’, the Compact Linear Collider’®, the Circular Electron- 
Positron Collider” and the electron-positron option of the Future 
Circular Collider’’. The virtual absence of synchrotron radiation makes 
it possible to build a substantially smaller facility with the same or 
greater physics reach. 

The energy available in collisions between the constituent gluons 
and quarks in proton-proton collisions is considerably less than the 
energy of the proton beam because the colliding quarks and gluons 
each carry only a fraction of the proton’s momentum. Muons carry 
the full energy of the beam, making muon colliders attractive for the 
study of particle physics beyond the energy reach of facilities such as 
the Large Hadron Collider”. 

Most of the proposals for accelerated muon beams exploit the pro- 
ton-driven muon-beam production scheme outlined above and use 
beam cooling to increase the brightness of the tertiary muon beam 
before acceleration and storage to ensure sufficient luminosity or beam 
current. Four cooling techniques are in use at particle accelerators: 
synchrotron radiation cooling”, laser cooling”, stochastic cooling” 
and electron cooling”. In each case, the time required to cool the beam 
is long compared to the muon lifetime. Frictional cooling of muons, 
in which muons are electrostatically accelerated through an energy- 
absorbing medium at energies significantly below 1 MeV, has been 
demonstrated but with low efficiency. 


*A list of participants and their affiliations appears at the end of the paper. 
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Fig. 1| The MICE apparatus, the calculated magnetic field and the nominal 
horizontal width of the beam. The modelled field, B,, is shown onthe beam 
axis (black line) and at 160 mm from the axis (green line) in the horizontal plane. 
The readings of Hall probes situated at 160 mm from the beam axis are also 
shown. Vertical lines indicate the positions of the tracker stations (dashed 


The technique demonstrated in this study, ionization cooling”, is 
based ona suitably prepared beam passing through an appropriate 
material (the absorber) and losing momentum through ionization. 
Radio-frequency cavities restore momentum only along the beam direc- 
tion. Passing the muon beam through a repeating lattice of material and 
accelerators causes the ionization cooling effect to build up in atime 
much shorter than the muon lifetime’ “. Acceleration ofamuon beam 
in a radio-frequency accelerator has recently been demonstrated”’, 
and reduced beam heating, damped by the ionization cooling effect, 
has been observed”. Ionization cooling has not been demonstrated 
so far. Experimental validation of the technique is important for the 
development of muon accelerators. The international Muon loniza- 
tion Cooling Experiment (MICE; http://mice.iit.edu) was designed to 
demonstrate transverse ionization cooling, the realization of which 
is presented here. 

The brightness of a particle beam can be characterized by the num- 
ber of particles in the beam and the volume occupied by the beam in 
position-momentum phase space. The phase-space volume occupied 
by the beam and the phase-space density of the beam are conserved 
quantities in a conventional accelerator without cooling. The phase 
space considered here is the position and momentum transverse to 
the direction of travel of the beam, u = (x, p,, y, py), where x and y are 
coordinates perpendicular to the beam line, and p, and p, are the cor- 
responding components of the momentum. The zaxis is the nominal 
beam axis. 

The normalized root-mean-square (r.m.s.) emittance is convention- 
ally used as an indicator of the phase-space volume occupied by the 
beam’, but this quantity is not conserved when scraping or optical 
aberrations affect the edge of the beam. The distribution of ampli- 
tudes*°” is used here to study effects in the core of the beam. The 
amplitude of a particle is the distance of the particle from the beam 
centroid in normalized phase space, and is a conserved quantity in 
a conventional accelerator without cooling. The phase-space den- 
sity of the beam is also directly studied using a k-nearest-neighbour 
technique™. 


MICE cooling apparatus 


The MICE collaboration has built a tightly focusing solenoid lattice, 
absorbers and instrumentation to demonstrate the ionization cooling 
of muons. A schematic of the apparatus is shown in Fig. 1. 
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Downstream tracker EMR 


lines) and the absorber (dotted line). The nominal r.m.s. beam width, a(x), is 
calculated assuming a nominal input beam and using linear beam transport 
equations. See text for the description of the MICE apparatus. TOFO, TOFland 
TOF2are time-of-flight detector stations; KLis alead-scintillator pre-shower 
detector; EMRis the Electron—-Muon Ranger. 


A transfer line? * brought a beam, composed mostly of muons, 
froma target” in the ISIS synchrotron” to the cooling apparatus. The 
central momentum of the muons could be tuned between 140 MeV ct 
and 240 MeV c'(c, speed of light in vacuum). A variable-thickness brass 
and tungsten diffuser allowed the emittance of the incident beam to 
be varied between 4 mm and 10 mm. 

The tight focusing (low B function) and large acceptance required by 
the cooling section was achieved using 12 superconducting solenoids. 
The solenoids were contained in three warm-bore modules cooled by 
closed-cycle cryocoolers. The upstream and downstream modules 
(spectrometer solenoids) were identical, each containing three coils to 
provide a uniform field region of up to 4 T within the 400-mm-diameter 
warm bore for momentum measurement, as well as two ‘matching’ coils 
to match the beam to the central pair of closely spaced ‘focus’ coils, 
which focus the beam onto the absorber. The focus coils were designed 
to enable peak on-axis fields of up to 3.5 T within one module witha 
500-mm-diameter warm bore containing the absorbers. 

For the experiment reported here the focus coils were operated in 
‘flip’ mode with a field reversal at the centre. Because the magnetic 
lattice was tightly coupled, the cold mass-suspension systems of the 
modules were designed to withstand longitudinal cold-to-warm forces 
of several hundred kN, which could arise during an unbalanced quench 
of the system. At maximum field, the inter-coil force on the focus coil 
cold mass was of the order of 2 MN. The total energy stored inthe mag- 
netic system was of the order of 5 MJ and the system was protected 
by both active and passive quench-protection systems. The normal 
charging and discharging time of the solenoids was several hours. 
The entire magnetic channel was partially enclosed by a150-mm-thick 
soft-iron return yoke for external magnetic shielding. The magnetic 
fields inthe tracking volumes were monitored during operation using 
calibrated Hall probes. 

One of the matching coils in the downstream spectrometer solenoid 
was not operable owing to a failure of asuperconducting lead. Although 
this necessitated a compromise in the lattice optics and acceptance, 
the flexibility of the magnetic lattice was exploited to ensure a clear 
cooling measurement. 

The amplitude acceptance of approximately 30 mm, above which 
particles scrape, was large compared to that of a typical accelera- 
tor. Even so, considerable scraping was expected and observed for 
the highest-emittance beams. Ionization cooling cells with even 
larger acceptances, producing less scraping, have been designed? ™. 
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Fig. 2 | Beam distribution in phase space for the 6-140 Full LH2 setting of MICE. Measured beam distribution in the upstream tracker (above the diagonal) and inthe 
downstream tracker (below the diagonal). The measured coordinates of the particles are coloured according to the amplitude A, of the particle. 


The magnetic lattice of MICE, shown in Fig. 1, was tuned so that the 
focus of the beam was near the absorber, resulting ina small beam width 
and large angular divergence. The tight focusing, corresponding toa 
nominal transverse B function of around 430 mm at the centre of the 
absorber, yielded an optimal cooling performance. 

Materials with low atomic number, such as lithium and hydrogen, 
have a long radiation length relative to the rate of energy loss, and 
consequently better cooling performance, making them ideal absorber 
materials. Therefore, cooling by both liquid-hydrogen and lithium 
hydride absorbers was studied. 

The liquid hydrogen was contained within a 22- vessel** in the warm 
bore of the focus coil. Hydrogen was liquefied by a cryocooler and piped 
through the focus coil module into the absorber body. When filled, 
the absorber presented 349.6 + 0.2 mm of liquid hydrogen along the 
beam axis with a density of 0.07053 + 0.00008 g cm ° (all uncertain- 
ties represent the standard error). The liquid hydrogen was contained 
between a pair of aluminium windows covered by multi-layer insula- 
tion. Asecond pair of windows provided a secondary barrier to protect 
against failure of the primary containment windows. These windows 
were designed to be as thin as possible so that any scattering in them 
would not cause substantial heating. The total thickness of all four 
windows on the beam axis was 0.79 + 0.01mm. 

The lithium hydride absorber was a disk of thickness 65.37 + 0.02 mm 
witha density of 0.6957 + 0.0006 gcm*. The isotopic composition of 
the lithium used to produce the absorber was 95% °Li and 5% ’Li. The 
cylinder had a thin coating of parylene to prevent ingress of water or 
oxygen. Configurations with the empty liquid-hydrogen containment 
vessel and with no absorber were also studied. 


MICE beam instrumentation 


Detectors placed upstream and downstream of the apparatus meas- 
ured the momentum, position and species of each particle entering 
and leaving the cooling channel in order to reconstruct the full four- 
dimensional phase space, including the angular momentum intro- 
duced by the solenoids. Particles were recorded by the apparatus 


one at a time, which enabled high-precision instrumentation to be 
used and particles other than muons to be excluded from the analysis. 
Each ensemble of muons was accumulated over a number of hours. 
This is acceptable because space-charge effects are not expected ata 
neutrino factory and ina muon collider they become important only 
at very low longitudinal emittance*’. Data-taking periods for each 
absorber were separated by a period of weeks owing to operational 
practicalities. The phase-space distribution of the resulting ensemble 
was reconstructed using the upstream and downstream detectors. 
The emittance reconstruction in the upstream detector system is 
described in ref. *°. 

Upstream of the cooling apparatus, two time-of-flight (TOF) detec- 
tors*! measured the particle velocity. Acomplementary velocity meas- 
urement was made upstream by the threshold Cherenkov counters 
Ckov Aand Ckov B™. Scintillating fibre trackers, positioned in the uni- 
form-field region of each of the two spectrometer solenoids, measured 
the particle position and momentum upstream and downstream of the 
absorber***, Downstream, an additional TOF detector®, a mixed lead- 
scintillator pre-shower detector and a totally active scintillator calorim- 
eter, the Electron-Muon Ranger**”, identified electrons produced by 
muon decay and allowed cross-validation of the measurements made 
by the upstream detectors and the trackers. 

Eachtracker consisted of five planar scintillating-fibre stations. Each 
station comprised three views; each view was composed of two layers 
of 350-um-diameter scintillating fibres positioned at an angle of 120° 
with respect to the other views. The fibres were read out by cryogenic 
visible-light photon counters“. The position of a particle crossing the 
tracker was inferred from the coincidence of signals from the fibres, 
and the momentum was calculated by fitting a helical trajectory tothe 
signal positions, with appropriate consideration for energy loss and 
scattering in the fibres. 

Each TOF detector was constructed from two orthogonal planes 
of scintillator slabs. Photomultiplier tubes at each end of every TOF 
detector slab were used to determine the time at which a muon passed 
through the apparatus with a 60-ps resolution”. The momentum reso- 
lution of particles with a small helix radius in the tracker was improved 
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Fig. 3 |Muon amplitudes measured by MICE. The measured upstream 
distributions are shown by red circles while the downstream distributions are 
shown by greentriangles. Both upstream and downstream distributions are 
normalized to the bin with the most entries in the upstream distribution (see 
text). Coloured bands show the estimated standard error, whichis dominated 


by combining the TOF measurement of velocity with the measurement 
of momentum in the tracker. 

Adetailed Monte Carlo simulation of the experiment was performed 
to study the resolution and efficiency of the instrumentation and to 
determine the expected performance of the cooling apparatus*”°°. The 
simulation was found to give a good description of the data*®. 


Demonstration of cooling 


The data presented here were taken using beams with a nominal 
momentum of 140 MeV c ‘andanominal normalized r.m.s. emittance in 
the upstream tracking volume of 4 mm, 6 mm and 10 mm; these settings 
are denoted as ‘4-140’, ‘6-140’ and ‘10-140’, respectively. Beams witha 
higher emittance have more muons at high amplitudes and occupy a 
larger region in phase space. For each beam setting, two samples were 
considered for the analysis. The ‘upstream sample’ contained particles 
identified as muons by the upstream TOF detectors and tracker, for 
which the muontrajectory reconstructed in the upstream tracker was 
fully contained in the fiducial volume and for which the reconstructed 
momentum fell within the range 135 MeV c'to 145 MeV c? (which is 
considerably higher than the momentum resolution of the tracker, 
2 MeV c’). The ‘downstream sample’ was the subset of the upstream 
sample for which the reconstructed muons were fully contained inthe 
fiducial volume of the downstream tracker. Each of the samples had 
between 30,000 and 170,000 events. Examples of the phase-space 
distributions of the particles inthe two samples are shown in Fig. 2. The 
strong correlations between y and p, and between x and p, are due to 
the angular momentum introduced by the solenoidal field. The shorter 
tails along the semi-minor axis compared to the semi-major axis in 
these projections arise from scraping in the diffuser. 

The distributions of amplitudes in the upstream and downstream 
samples for each of the 4-140, 6-140 and 10-140 datasets are shown 
in Fig. 3. The nominal acceptance of the magnetic channel is also 
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by systematic uncertainties. Vertical lines indicate the approximate channel 
acceptance above which scraping occurs. The number of events in each sample 
is listed in Extended Data Table 2. Data for each experimental configuration 
were accumulated ina single discrete period. 


indicated. A correction has been made to account for the migration of 
events between amplitude bins due to the detector resolution and to 
account for inefficiency in the downstream detector system (see Meth- 
ods). Distributions are shown for the measurements with an empty 
liquid-hydrogen vessel (‘Empty LH,’), with a filled liquid-hydrogen 
vessel (‘Full LH,’), with no absorber (‘No absorber’) and with the lithium 
hydride absorber (‘LiH’). The distributions were normalized to allow 
acomparison of the shape of the distribution between different absorb- 
ers. Each pair of upstream and downstream amplitude distributions 
is scaled by 1/Ni,,,, where Ny, is the number of events in the most 
populated bin in the upstream sample. 

The behaviour of the beam at low amplitude is the key result of this 
study. For the ‘No absorber’ and ‘Empty LH,’ configurations, the num- 
ber of events with low amplitude in the downstream sample is similar 
to that observed in the upstream sample. For the 6-140 and 10-140 
configurations for both the ‘Full LH,’ and the ‘LiH’ samples, the number 
of events with low amplitude is considerably larger inthe downstream 
sample than in the upstream sample. This indicates an increase in the 
number of particles in the beam core when an absorber is installed, 
whichis expected if ionization cooling takes place. This effect can occur 
only because energy loss is anon-conservative process. 

A reduction in the number of muons at high amplitude is also 
observed, especially for the 10-140 setting. Whereas part of this effect 
arises owing to migration of muons into the beam core, a substantial 
number of high-amplitude particles outside the beam acceptance 
intersected the beam pipe or fell outside the fiducial volume of the 
downstream tracker. The beam pipe was made of materials with higher 
atomic number than those of the absorber materials, so interactions 
in the beam pipe tended to be dominated by multiple Coulomb scat- 
tering, leading to beam loss. 

AX’ test was performed to determine the confidence with which the 
null hypothesis that for the same input beam setting, the amplitude dis- 
tributions inthe downstream samples of the ‘Full LH,’ and ‘Empty LH,’ 
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Fig. 4| Downstream-to-upstream ratio ofnumber of events in MICE. A ratio 
greater than unity inthe beam core, whichis evidence of ionization cooling, is 
observed in the data obtained with the 6-140 and 10-140 beams with both the 
full LH, absorber and the LiH absorber. The effect predicted from the 
simulation is shown in red and that measured is shown in black. The 


configurations are compatible, and the amplitude distributions inthe 
downstream samples of the ‘LiH’ and ‘No absorber’ configurations are 
compatible. The test was performed onthe uncorrected distributions 
using only statistical uncertainties. Systematic effects are the same for 
the pairs of distributions tested, and cancel. Assuming that this null 
hypothesis is correct, the probability of observing the effect seen in 
the data is considerably lower than 10° for each beam setting and for 
each ‘Full LH,’-‘Empty LH,’ and ‘LiH’—‘No absorber’ pair; therefore, the 
null hypothesis was rejected. 

The fractional increase in the number of particles with low amplitude 
is most pronounced for the 10-140 beams. High-amplitude beams 
have high transverse emittance, €,, and a larger transverse momen- 
tum relative to the stochastic increase in transverse momentum due 
to scattering, so they undergo more cooling. For the magnet settings 
and beams studied here, heating due to multiple Coulomb scattering 
becomes dominant over ionization cooling at an emittance of around 
4mm. Asaresult, only modest cooling is observed for the 4-140 setting 
in both the ‘Full LH,’ and ‘LiH’ configurations. 

The ratios of the downstream to the upstream amplitude distribu- 
tions are shown in Fig. 4. In the ‘No absorber’ and ‘Empty absorber’ 
configurations, the ratios are consistent with 1 for amplitudes of less 
than 30 mm, confirming the conservation of amplitude in this region, 
irrespective of the incident beam. Above 30 mm the ratios drop below 
unity, indicating that at high amplitude there are fewer muons down- 
stream than upstream, as outlined above. The presence of the absorber 
windows does not strongly affect the amplitude distribution. For the 
6-140 and 10-140 datasets, the addition of liquid-hydrogen or lithium 
hydride absorber material causes the ratios to rise above unity for 
the low-amplitude particles that correspond to the beam core. This 
indicates an increase in the number of particles in the beam core and 
demonstrates ionization cooling. 

The density in phase space is an invariant of a symplectic system; 
therefore, an increase in phase-space density is also an unequivocal 


40 20 40 60 


corresponding shading shows the estimated standard error, whichis 
dominated by systematic uncertainty. Vertical lines indicate the channel 
acceptance above which scraping occurs. The number of events ineach sample 
is listed in Extended Data Table 2. Data for each experimental configuration 
were accumulated ina single discrete period. 


demonstration of cooling. Figure 5 shows the normalized density of 
the upstream and downstream samples, p,(u;)/p,, as a function of a, 
the fraction of the upstream sample that has a density greater than 
or equal to g;. This is known as the quantile distribution. To enable 
comparison between different beam configurations, the densities for 
each configuration have been normalized to the peak density in the 
upstream tracker, p,. To enable comparison between the upstream and 
downstream distributions, the fraction of the sample is always relative 
tothe total number of events inthe upstream sample. The transmission 
is the fraction of the beam for which the density in the downstream 
tracker reaches zero. For the ‘No absorber’ and ‘Empty LH,’ cases, the 
downstream density in the highest-density regions is indistinguishable 
from the upstream density. A small amount of scraping is observed for 
the 4-140 and 6-140 beams. More substantial scraping is observed for 
the 10-140 beam. In all cases, for ‘Full LH,’ and ‘LiH’ the phase-space 
density increases, and the increase is greater for higher-emittance 
beams. These observations demonstrate the ionization cooling of the 
beam when an absorber is installed. In the presence of an absorber, 
beams with larger nominal emittance show a greater increase in density 
than those with a lower nominal emittance. 


Conclusions 


lonization cooling has been unequivocally demonstrated. We have 
built and operated a section of a solenoidal cooling channel and dem- 
onstrated the ionization cooling of muons using both liquid hydrogen 
and lithium hydride absorbers. The effect has been observed through 
the measurement of both an increase inthe number of small-amplitude 
particles (Figs. 3, 4) and an increase in the phase-space density of the 
beam (Fig. 5). The results are well described by simulations (Fig. 4). This 
demonstration of ionization cooling is an important advance in the 
development of high-brightness muon beams. The seminal results pre- 
sented in this paper encourage further development of high-brightness 
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Fig. 5 | Normalized quantile distribution of the beam density in MICE. 
Upstream and downstream quantiles are indicated by orange and green lines, 
respectively, as a function of the fraction of the upstream sample. For each 
configuration, the density is normalized to the highest-density region inthe 


muon beamsasatool for the investigation of the fundamental proper- 
ties of matter. 
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Methods 


Characterization of beam brightness 

In particle accelerators, the average beam brightness B is defined as 
the beam current, /, passing through a transverse phase-space volume 
V, (ref. *) 


B=— (1) 


The normalized r.m.s. emittance is often used as an indicator of the 
phase-space volume occupied by the beam and is given by” 


_ AV 
éE= m,¢ (2) 


where m, isthe muon mass and |V| is the determinant of the covariance 
matrix of the beam in the transverse phase space u = (x, p,, y, py). The 
covariance matrix has elements Uy = (UjUl;) — (Uu)XU;). The distribution 
of individual particle amplitudes also describes the volume of the beam 
in phase space. 

The amplitude is defined by” 


A, =€,R7(u, (u)) (3) 


where R?(u, v) is the square of the distance between two points, uand 
v, inthe phase space, normalized to the covariance matrix: 


R2(u, v) = (u-v)'V-(u-v) (4) 


The normalized r.m.s. emittance is proportional to the mean of the 
particle amplitude distribution. In the approximation that particles 
travel near the beam axis, and in the absence of cooling, the particle 
amplitudes and the normalized r.m.s. emittance are conserved quan- 
tities. If the beam is well described by a multivariate Gaussian distri- 
bution, then R’ is distributed according to ay’ distribution with four 
degrees of freedom, so the amplitudes are distributed according to 
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The rate of change of the normalized transverse emittance as the 
beam passes through an absorber is given approximately by®”?*! 
dE, 
dz 


de, E, 


B,(13.6 MeV c 1)? 
RS + 
dz BE, 


2B°E,m,Xo 


where fc is the muon velocity, £, is the muon energy, |dE,/dz| is the 
mean energy loss per unit path length, X, is the radiation length of the 
absorber and £, is the transverse betatron function at the absorber”. 
The first term of this equation describes ‘cooling’ by ionization energy 
loss and the second term describes ‘heating’ by multiple Coulomb scat- 
tering. Equation (6) implies that there is an equilibrium emittance for 
which the emittance change is zero. 

Ifthe beam is well described by a multivariate Gaussian distribution 
both before and after cooling, then the downstream and upstream 
amplitude distributions f“(A ,) and “(A ,) are related to the downstream 


and upstream emittances e4 and 4 by 


2 
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In the experiment described in this paper, many particles do not 
travel near the beam axis. These particles experience effects from 


(6) 


optical aberrations, as well as geometrical effects such as scraping, 
in which high-amplitude particles outside the experiment’s aperture 
are removed from the beam. Scraping reduces the emittance of the 
ensemble and selectively removes those particles that scatter more than 
the rest of the ensemble. Optical aberrations and scraping introduce 
a bias in the change in r.m.s. emittance that occurs because of ioniza- 
tion cooling. In this work the distribution of amplitudes is studied. To 
expose the behaviour inthe beam core, independently of aberrations 
affecting the beam tail, Vand €, are recalculated for each amplitude 
bin, including particles that are in lower-amplitude bins and excluding 
particles that are in higher-amplitude bins. This results in a distribu- 
tion that, in the core of the beam, is independent of scraping effects 
and spherical aberrations. 

The change in phase-space density provides a direct measurement 
of the cooling effect. The k-nearest-neighbour algorithm provides a 
robust non-parametric estimator of the phase-space density of the 
muon ensemble****", The separation of pairs of muons is characterized 
by the normalized squared distance, Rj(u;, u,), between muons with 
positions u,and u,. A volume V, is associated with each particle, which 
corresponds to the hypersphere that is centred on u, and intersects 
the kth nearest particle (that is, the particle that has the Ath smallest 
R,). The density, p;, associated with the ith particle is estimated by 


a oe) 
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p,(u;) (8) 


where nis the number of particles in the ensemble. An optimal value 
for kisused, k=n*/4*” = . /n, with phase-space dimension d=4 (ref. ). 


Data taking and reconstruction 

Data were buffered in the front-end electronics and read out after each 
target actuation. Data storage was triggered by acoincidence of signals 
in the photomultiplier tubes (PMTs) serving a single scintillator slab 
in the upstream TOF station closest to the cooling channel (TOF1). 
The data recorded in response to a particular trigger are referred to 
as a ‘particle event’. 

Each TOF station was composed of a number of scintillator slabs 
that were read out using a pair of PMTs, one mounted at each end of 
each slab. The reconstruction of the data began with the search for 
coincidences in the signals from the two PMTs serving any one slab 
ina TOF plane. Such coincidences are referred to as ‘slab hits’. ‘Space 
points’ were then formed from the intersection of slab hits inthe x and 
y projections of each TOF station separately. The position and time at 
which a particle giving rise to the space point crossed the TOF station 
were then calculated using the slab position and the times measured in 
each of the PMTs. The relative timing of the two upstream TOF stations 
(TOFO and TOF1) was calibrated relative to the measured time taken for 
electrons to pass between the two TOF detectors, on the assumption 
that they travelled at the speed of light. 

Signals in the tracker readout were collected to reconstruct the 
helical trajectories (‘tracks’) of charged particles in the upstream and 
downstream trackers (TKU and TKD, respectively). Multiple Coulomb 
scattering introduced significant uncertainties in the reconstruction 
of the helical trajectory of tracks with a bending radius of less than 
5mm. For this class of track, the momentum was deduced by combin- 
ing the tracker measurement with the measurements from nearby 
detectors. The track-fitting quality was characterized by the y’ per 
degree of freedom 


8x? 
oF 


2 1 

Ka ae (9) 
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where 6x; is the distance between the fitted track and the measured 


signal in the ith tracker plane, o,is the resolution of the position meas- 
urement in the tracker planes and n is the number of planes that had 


a signal used in the track reconstruction. Further details of the recon- 
struction and simulation may be found in ref. °°. 


Beam selection 

Measurements made in the instrumentation upstream of the absorber 
were used to select the input beam. The input beam (the upstream 
sample) was composed of events that satisfied the following criteria: 
* Exactly one space point was found in TOFO and TOF1 and exactly one 
track in TKU. 

* The track in TKU had 4G: < 8 and was contained within the 150-mm 
fiducial radius over the full length of TKU. 

¢ The track in TKU had a reconstructed momentum in the range 135- 
145 MeV c!, corresponding to the momentum acceptance of the cool- 
ing cell. 

¢ The time-of-flight between TOFO and TOF1 was consistent with that 
of amuon, given the momentum measured in TKU. 

¢ The radius at which the track in TKU passed through the diffuser was 
smaller than the diffuser aperture. 

The beam emerging from the cooling cell (the downstream sample) 
was characterized using the subset of the upstream sample that satis- 
fied the following criteria: 
¢ Exactly one track was found in TKD. 
¢ The track in TKD had on < 8 and was contained within the 150-mm 
fiducial radius of TKD over the full length of the tracker. 

The same sample-selection criteria were used to select events from 
the simulation of the experiment, which included a reconstruction of 
the electronics signals expected for the simulated particles. 


Calculation of amplitudes 

The amplitude distributions obtained from the upstream and down- 
stream samples were corrected for the effects of the detector efficiency 
and resolution and for the migration of events between amplitude bins. 
The corrected number of events ina bin, N7°", was calculated from the 
raw number of events, N¥™, using 


end 


neo = ad SN” (10) 


where £,is the efficiency correction factor and S, accounts for the detec- 
tor resolution and event migration. £;and S, were estimated from the 
simulation of the experiment. The uncorrected and corrected ampli- 
tude distributions for a particular configuration are shown in Extended 
Data Fig. 1. The correction is small relative to the ionization cooling 
effect, which is clear even in the uncorrected distributions. 

Itcan be seen from equation (7) that in the limit of small amplitudes, 
and in the approximation that the beam is normally distributed in the 
phase-space variables, the ratio of the number of muons is equal to the 
ratio of the square of the emittances, 


(11) 
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The ratio of f to f" in the lowest-amplitude bin of Fig. 3, which is an 
approximation to this ratio, is listed in Extended Data Table 1. 


Data availability 


The unprocessed and reconstructed data that support the findings 
of this study are publicly available on the GridPP computing Grid at 
https://doi.org/10.17633/rd.brunel.3179644 (MICE unprocessed data) 
and https://doi.org/10.17633/rd.brunel.5955850 (MICE reconstructed 
data). Source data for Figs. 3-5 and Extended Data Fig. 1 are provided 
with the paper. 
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access to their data. Third-party results are not endorsed by the MICE 
collaboration.” 


Code availability 


The MAUS software” that was used to reconstruct and analyse the 
MICE data is available at https://doi.org/10.17633/rd.brunel.8337542. 
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Extended Data Fig. 1| Corrected and uncorrected amplitude distributions tothe upstream distribution and green triangles tothe downstream 
for the 10-140 ‘LH2 full’ configuration. The uncorrected data are shown by distribution. Shading represents the estimated total standard error. Error bars 
open points and the corrected data by filled points. Orange circles correspond show thestatistical error and for most points are smaller than the markers. 


Extended Data Table 1 | Ratio of number of muons downstream to number of muons upstream having an amplitude of less 
than5 mm 


4-140 6-140 10-140 

LH2 empty | 0.98 + 0.005 + 0.05 1.01 + 0.006 + 0.05 1.02 + 0.02 + 0.05 
LH2 full 1.12 + 0.009 + 0.05 1.16 + 0.009 40.05 1.32 + 0.02 + 0.07 
None 0.99 + 0.006 + 0.05 1.00 + 0.005 + 0.05 1.06 + 0.02 + 0.06 
Li 1.07 + 0.008 £0.05 1.14+40.01+0.05 1.23 + 0.03 + 0.07 


Uncertainties denote standard error; statistical uncertainty is followed by the total uncertainty. 
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Extended Data Table 2 | Number of events in the samples shown in Fig. 3-5 


4-140 6-140 10-140 
Upstream Downstream | Upstream Downstream | Upstream Downstream 
LH2 empty 163508 153813 158520 140981 123067 85082 
LH2 full 71823 67640 117383 107329 82371 62660 
None 91804 86877 172606 153809 54195 37436 
LiH 87514 82682 98443 89875 43423 32715 
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Precision spectroscopy of atomic systems’ is an invaluable tool for the study of 


fundamental interactions and symmetries’. Recently, highly charged ions have been 
proposed to enable sensitive tests of physics beyond the standard model” * and the 
realization of high-accuracy atomic clocks®», owing to their high sensitivity to 
fundamental physics and insensitivity to external perturbations, which result from 
the high binding energies of their outer electrons. However, the implementation of 
these ideas has been hindered by the low spectroscopic accuracies (of the order of 
parts per million) achieved so far® ®. Here we cool trapped, highly charged argon ions 
to the lowest temperature reported so far, and study them using coherent laser 
spectroscopy, achieving an increase in precision of eight orders of magnitude. We use 
quantum logic spectroscopy”” to probe the forbidden optical transition in *°Ar™ ata 
wavelength of 441 nanometres and measure its excited-state lifetime and g-factor. Our 
work unlocks the potential of highly charged ions as ubiquitous atomic systems for 
use in quantum information processing, as frequency standards and in highly 
sensitive tests of fundamental physics, such as searches for dark-matter candidates” 
or violations of fundamental symmetries’. 


Like a microscope aimed at the quantum world, laser spectroscopy 
pursues ever higher resolving power. Every increase in resolution ena- 
bles deeper insights into the subtle effects that all known fundamental 
interactions have on the atomic wavefunction. Advances in optical- 
frequency metrology have improved resolution drastically in the last 
three decades! and have made laser spectroscopy an extremely sensitive 
tool for studying open physics questions, suchas the nature of dark mat- 
ter, the strength of parity violation and a possible violation of Einstein’s 
theory of relativity”. However, only a few atomic and ionic species are 
currently within the reach of cutting-edge optical-frequency metrology. 
Expanding this field of exploration to systems with high sensitivity to 
such effects is therefore crucial. Owing to the very high binding ener- 
gies of their outer electrons, highly charged ions (HCIs) are promising 
candidates for such fundamental tests. The fractional contributions 
to the electronic transition energies from special relativity, quantum 
electrodynamics (QED) and the nucleus are several orders of magnitude 
larger than those in neutral atoms. This renders them ideal systems for 
benchmarking the most advanced theories and calculations, which has 
been repeatedly demonstrated via optical fluorescence spectroscopy in 
electron-beam ion traps (EBITs)*”, X-ray spectroscopy in storage rings” 
and EBITs®, and ground-state g-factor studies in Penning traps”. The 
hyperfine splitting of the 1s state in heavy hydrogen-like ions can even 
shift into the optical range, providing laser-accessible transitions (see, 
for example, refs. '*?°) with nuclear-size contributions of the order of 
several per cent of the total transition energy. 

It was realized recently that non-gravitational coupling of dark mat- 
ter to ordinary matter would affect atomic energy levels” and thus 


become observable in optical-clock comparisons as an apparent drift 
or modulation of the fine-structure constant a. HCls offer narrow- 
linewidth optical transitions that are among the most sensitive to a 
possible variation of a (ref. *). In addition, their inherent insensitivity 
to external electric fields’ leads to considerably smaller systematic 
perturbations compared to neutral and singly charged atoms. This 
makes them potentially superior references for high-accuracy optical 
atomic clocks, with many proposed species reviewed in ref. °. How- 
ever, so far no experiment has performed laser spectroscopy at the 
required level of precision. The major limitation was set by the high 
temperature of a few million kelvins at which HCls are produced and 
typically stored. This induces Doppler broadenings with full-width- 
at-half-maximum (FWHM) linewidths of several tens of gigahertz 
and corresponding line-centre uncertainties of a few hundreds of 
megahertz in the best cases° ®. Because HCls generally do not offer 
suitable transitions for direct laser cooling, sympathetic cooling of 
multiple HCls by laser-cooled *Be* ions was implemented in a Pen- 
ning trap at the Lawrence Livermore National Laboratory”, reaching 
anion temperature of around 4 K. More recently, the Cryogenic Paul 
Trap Experiment” demonstrated reliable Coulomb crystallization of 
single *°Ar’** ions ina crystal of many Be’ ions. Sympathetic Doppler 
cooling down to the 10-mK level and two-ion crystal preparation?>”* 
paved the way for high-accuracy spectroscopy. Even so, spectroscopy 
of narrow transitions in single ions requires efficient state detection 
ona different, fast-cycling transition’, whichis typically also used for 
laser cooling. If sucha transition is not available, quantum logic spec- 
troscopy (QLS) canbe employed’”’. In QLS, the ‘spectroscopy ion’ (in 
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Fig. 1| Time sequence of HCI recapture and two-ion crystal preparation. a, A 
laser-cooled Coulomb crystal of 50-100 fluorescing °Be* ions is confined inthe 


Paul trap. b, A single Ar'* ionis injected along the crystal axis, sympathetically 


cooled and finally co-crystallized with °Be’*. It appears as a large dark void 
owing to the repulsion of the “Be* by the high charge state. c, Excess °Be* ions 
are removed by modulating the Paul trap radio-frequency potential inthe 
absence of laser cooling, resulting in heating and ion losses. d, Finally, the 
Ar*—°Be* two-ion crystal is prepared. 


this case the HCI) is co-trapped with a so-called ‘logic ion’ (?Be*) that 
provides sympathetic cooling and state preparation and is used for 
state detection. These functions are enabled by the Coulomb inter- 
action between the two ions, allowing lasers to couple their internal 
electronic levels with their quantised joint motion in the trap. This 
technique and variations thereof have been successfully employed for 
optical atomic clocks based on Al’ ions”, for internal state detection 
and spectroscopy of molecular ions”*”’ and for spectroscopy of broad 
transitions in atomicions**™. 

Here, we demonstrate QLS of an HCI-specifically, of the electric- 
dipole-forbidden transition between the *P,,. and ?P;,, fine-structure 
levels of *°Ar’** at a wavelength of 441 nm, the most accurately known 
transition in any HCI°. We achieve an FWHM well below 100 Hz, close to 
the natural linewidth of 17 Hz. Single line scans taken onatimescale ofa 
few minutes determine the line centre with an uncertainty of less than 
2Hz. This corresponds to a fractional statistical uncertainty of 3 x10 
for a transition frequency of approximately 680 THz and compares 
favourably to previous measurements taken over hours or even days, 
which achieved relative uncertainties® § of 2 x 10’. Quantum logic- 
assisted state preparation of the ‘°Ar’ ion allows us to measure all six 
Zeeman components of the transition, which split up on a megahertz 
scale in a 160-yT magnetic quantization field. This allows us to deter- 
mine theg-factor of the P3,. excited state with unprecedented accuracy. 
Furthermore, we demonstrate a quantum logic-assisted excited-state 
lifetime measurement. 


Preparation ofa single HCI 
A detailed description of the experimental setup is given in Methods 
(see also Extended Data Figs. 1, 2). In brief, argon HCls are produced by 
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Fig. 2| Schematic illustration of the experimental cycle. a, Level diagrams 
(not to scale) of boron-like Ar" and °Be*. The motional Fock state of the crystal 
is denoted as|n),,. Solid and dotted black lines indicate the corresponding 
ground and first excited motional states, respectively. b, Experimental 
sequence. After Doppler cooling on the °Be*|¥ ),—’P3,. transition followed by 
ground-state cooling to|0),,, by stimulated Raman transitions, the internal 
state of Ar'** is prepared (see also Fig. 4c and Extended Data Fig. 3). Aclock laser 
pulse addressing the carrier of the Ar’ transition is then applied (1). After an 
optional wait time for lifetime measurements, a clock laser 11-pulse on the Ar’>* 
red sideband (RSB) maps the Ar’* electronic state onto the common motional 
state (2). Ared-sideband 1-pulse on the °Be* hyperfine transition|¥),-|*), maps 
it onto the °Be’ electronic state (3). Finally, this state is detected using the 
Doppler cooling laser (4). 


anEBIT, PTB-EBIT™, and ejected from it in triggered bunches of -200 ns 
duration with a mean kinetic energy of approximately 700q V, where q 
isthe ion charge. The HCls are guided to the spectroscopy trap through 
anion optical beamline. Based ontheir time of flight, we select the °Ar* 
ions by rapidly switching a gate electrode. A pulsed gradient potential 
decelerates them electrodynamically® to about 146q V. Then, asingle 
ar ion stochastically enters the cryogenic linear Paul trap**. The trap 
is globally biased to +138 V, thereby slowing the HCI down to 8g Vupon 
entry. After passing through the trapping region, the HCl is reflected 
back by an electrode at the end of the Paul trap. A mirror electrode 
in front of the trap is switched up to prevent the ion from escaping 
again, thereby capturing the HCI in an oscillatory axial motion. The 
repeated crossing through a pre-prepared laser-cooled °Be* Coulomb 
crystal within the trap dissipates the residual kinetic energy of the HCI. 
After sufficient sympathetic cooling, the Ar ionjoins the Coulomb 
crystal. Excess °Be* ions are removed until a two-ion crystal has been 
prepared (see Fig. 1). The entire preparation procedure of the two-ion 
crystal takes only a few minutes. The Paul trap is refrigerated to less 
than5K byamechanically decoupled, closed-cycle cryostat to provide 
a vacuum below 10°” Pa (corresponding to a particle density of less 
than 20,000 cm*°), thus suppressing charge-exchange collisions and 
achieving HCI storage times® of the order of 45 min. 


Ground-state cooling and quantum logic 

The implementation of QLS requires control and preparation of the 
motional and internal states of both ions using coherent laser pulses 
oncarrier and sideband transitions. After the two-ion crystal prepara- 
tion, the strong Coulomb coupling between the two ions results in joint 
motional modes within the trap. Sympathetic cooling, state preparation 
and QLS are performed by repeating the experimental sequence shown 
in Fig. 2. First, Doppler cooling and optical pumping on the *Be*’S,.- 
*P,,. cycling transition (see Fig. 2a) are applied. The two axial normal 


Nature | Vol578 | 6 February 2020 | 61 


Article 


a b 07- c 

£ 2 2 

5 5 5 

oO oO o 

ne} 

: 3 3 

e a a 

2 5 5 

£ sg g 

5 # # 

x oO oO 

Ww 5 in 
0.0 0.0 ! | | 0.0 ! | | | | | | | 
-200 -100 0 100 200 0 5 10 15 0 5 10 15 20 25 30 35 40 


Detuning (Hz) 


Fig. 3|Rabispectroscopy and excited-state lifetime measurement. a, Clock 
laser frequency scan across Zeeman component (see Fig. 4c) of the *°Ar* 
fine-structure transition. The fixed probe time of 12 ms is longer than the 
excited-state lifetime of 9.6 ms. The line is fitted by a Rabi line shape (blue 
curve), reaching a Fourier-limited FWHM of about 65 Hz. b, On-resonance 
coherent excitation of this transition. The coherent state Rabi flopping signal 
(fitted by the red curve, which represents a damped sine with offset) exhibits a 
2.2-ms 1t-time (in which the maximum transferable population is transferred to 
the excited state) and decays exponentially with the excited-state lifetime (red- 


modes of the Coulomb crystal with secular frequencies of about 
Vip = 1.37 MHz (in phase) and Vop = 1.86 MHz (out of phase) are then 
cooled to the quantum mechanical ground state of motion with final 
average occupation numbers of 7 = 0.05 (in phase) and 7 = 0.02 (out 
of phase), corresponding to an effective temperature of less than 50 pK 
for each mode. For this purpose, we use laser pulses that coherently 
couple the electronic degrees of freedom to the common motional 
modes (a technique referred to as resolved sideband cooling). To do 
this, stimulated Raman transitions are driven between the Be’ hyper- 
fine qubit states|\ ), |), > |*),|2-1),{where|n),,denotes the motional 
quantum state of the in-phase or out-of-phase modes and |*), refers 
to the upper qubit state of the logic ion °Be*) using two laser beams 
with a wavelength of 313 nm. In our low magnetic field, the states are 
separated by a frequency of approximately 1.25 GHz. A repumping laser 
couples the *Be’’S,,,. and ’P,,. levels (not shownin Fig. 2a) for electronic- 
state preparation and for depopulation of state|*), (ref. °°). The Ar 
Zeeman ground state is then deterministically prepared with clock 
laser sideband pulses (see Extended Data Fig. 3). After full state prepa- 
ration, QLS’ is performed in four steps (see Fig. 2): (1) aclock laser pulse 
of tuneable length and power is applied, which couples the ground and 
excited states in Ar’* coherently. (2) After a variable wait time for 
excited-state lifetime measurements, a clock laser red-sideband t-pulse 
maps the excitation from the electronic Ar’ state onto the common 
axial out-of-phase mode and (3) another red-sideband t-pulse on the 
°Be* hyperfine qubit transition maps it onto the °Be’ electronic state. 
(4) Finally, the qubit state of Be* (dark,|*),, or bright,|\ ),) is detected 
with a fidelity of up to 98% by counting the fluorescence photons that 
it scatters from the Doppler cooling laser within 200 ps. A threshold 
value discriminates between the two states. The sequence is carried 
out multiple times (about 100) within a few seconds witha fixed set of 
parameters to average the quantum projection noise and evaluate a 
mean excitation probability. To resolve linewidths approaching the 
natural linewidth of 17 Hz, a narrow-linewidth clock laser is required. 
Our home-built laser system is composed of a commercial extended- 
cavity diode laser (ECDL) at 882 nm, whichis prestabilized with a high 
locking bandwidth of 4 MHztoa passive external reference cavity using 
the ECDL pump current and grating piezo as actuators for the feedback. 
Thereby, we suppress laser high-frequency noise and obtain an instan- 
taneous linewidth of about 2 kHz, whichis limited by the relatively low 
cavity finesse (of about 1,000) and lack of vibration-insensitive design. 
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shaded envelope). The error bars inaand b represent the quantum projection 
noise of 255 measurements per data point. c, Excited-state lifetime 
measurement. Quantum logic sequences (see text) are carried out asa function 
of the wait time between carrier and red-sideband clock laser pulses. During 
the wait time, the excited state can decay spontaneously. Fromathree- 
parameter maximum-likelihood estimation, we obtain a lifetime of 9.97(27) ms, 
limited by the quantum projection noise of 1,100 measurements per data point 
(error bars). The black curve and grey-shaded area show the estimated 
exponential decay with the corresponding louncertainty band. 


To suppress the residual noise, the laser is then further stabilized by 
phase-locking it to an ultrastable laser operating at a wavelength of 
1.5 um. The latter is itself stabilized to a cryogenic cavity made from 
crystalline silicon (referred to as Si2)””. This achieves a fractional fre- 
quency instability at the thermal noise limit of the cavity of 4 x10” at 
averaging times of 1-50 s. Using a femtosecond optical frequency comb 
as atransfer oscillator, we generate a virtual beatnote between the two 
lasers**, By demodulating it, we register their relative frequency and 
phase fluctuations, which are dominated by the substantially higher 
noise level of the 882-nm prestabilization cavity. The demodulated 
beatnote is used to generate a feedback signal for phase-locking the 
two lasers, which is applied to an acousto-optic modulator between 
the ECDL and the prestabilization cavity. The considerably lower band- 
width of the second locking stage ensures that the two loops do not 
compete with one another, but drifts and noise on the prestabilization 
cavity of upto kilohertz level are suppressed at the ECDL output, from 
which the spectroscopy light is derived. This suppresses the residual 
noise of the 882-nm clock laser, narrows its linewidth and reduces the 
daily drift to a level of about 10 Hz, which is dictated by Si2. The laser 
is frequency-doubled to 441 nm in an external enhancement cavity 
containing a periodically poled potassium titanyl phosphate crystal. 
Active power stabilization on a pulse-by-pulse basis at the ion trap is 
implemented for the clock, Doppler cooling and Raman lasers to 
achieve stable system parameters such as Rabi frequencies and a.c. 
Stark shifts. 

Coherent laser spectroscopy of *°Ar* 

By applying this technique, we carried out the first coherent laser spec- 
troscopy of an HCI. Figure 3a shows the excitation profile of the 
My = -1/2 to M3 = -3/2 Zeeman component of the *°Ar®*?P,,>—?P3/> 
fine-structure transition. The blue curve shows a fit to the line by a Rabi 
line shape, as expected for the top-hat laser pulse of 12-ms duration. 
This pulse length results in a Fourier-limited linewidth of 65 Hz FWHM. 
We do not observe any additional line broadenings on this first-order 
Zeeman sensitive transition at this level, which confirms our previous 
measurements* of a magnetic-field stability better than 1nT achieved 
via active stabilization of the field in the vicinity of the vacuum cham- 
ber. Additionally, alternating external magnetic fields are shielded by 
the highly conductive cryogenic thermal shields made of high-purity 
copper with a low-pass corner frequency of <0.3 Hz and suppression 
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Fig. 4|Zeeman structure of the *°Ar*"?P, .’P,,. fine-structure transition. 
a, Excitation probability as a function of the clock laser detuning from the 
degenerate line centre, showing the six individual Zeeman components. 
Comparable laser-ion couplings (1t-times of 5-6 ms) were chosen for each 
component, corresponding to Fourier-limited linewidths of around 150 Hz. 
The solid curves are Rabi line-shape fitting functions used to determine the 
centre frequencies. Error bars represent the quantum projection noise of 200 
repetitions. The varying excitation probabilities of the six components are 
caused by slightly different state preparation efficiencies. b, Reconstructed 


of 30-40 dB inthe frequency range 60 Hz-1 kHz (ref. **). The maximum 
fringe contrast of about 0.4 at this probe duration was mostly limited 
by the excited-state lifetime, with contributions from the ~90% fidel- 
ity of the sideband operations on the two ions, as well as from imper- 
fect state preparation and detection. Frequency scans with longer 
probe times can in principle resolve the natural linewidth, albeit at a 
reduced excitation probability. Figure 3b shows the on-resonance 
excitation probability as a function of the probe time for a higher inten- 
sity of the clock laser. Under continuous illumination, Rabi flopping 
between the two electronic states is observed (fitted by the red curve). 
The coherence decays with the known excited-state lifetime’ of 
9.573( i )stat ( i )syst MS(where the statistical and systematic standard 
deviations are given in parentheses), indicated by the red-shaded expo- 
nential envelope of the fit. This measurement confirms coherence 
beyond this timescale for both the clock laser and the magnetic field. 


-1/2 


Zeeman shifts of the ?P;,. substates (upper panel) and their residuals (lower 
panel) with respect toa linear fit (solid black line). Note the different vertical 
scales. For each of the substates m,=¥1/2, two data points are obtained from 
transitions 2, 3 and 4, 5, respectively. A magnetic-field instability of about 
0.5nT contributes to the standard uncertainties of the line centres, which 
become larger for the outer components. The slope of the linear fit is 
proportional to the ratio of the g-factors of the excited and ground states. 

c, Level diagram of the?P,,.andP3,. Zeeman substates and corresponding 
Zeeman components of the fine-structure transition. 


We also performed a direct measurement of the excited-state lifetime. 
For this, a carrier 1-pulse (step (1) in Fig. 2) with maximum laser inten- 
sity was applied, which populated the *°Ar’** excited state in about 
16 ps. After a variable wait time, the full transfer sequence was per- 
formed, and the remaining *°Ar’ excited-state fraction was mapped 
onto the °Be* qubit state (see also Methods). During the wait time, a 
series of ground-state-cooling pulses on both axial motional modes 
was applied every millisecond to keep the two-ion crystal in the 
motional ground state in the presence of anomalous heating of 12 and 
29 phonons per second for the out-of-phase and in-phase modes, 
respectively. By incrementing the wait time in 1-ms steps, an axial mode 
temperature independent of the wait time was ensured. The observed 
exponential spontaneous decay of the excited state is shown in Fig. 3c 
and results ina lifetime of 9.97(27) ms. This is about 1.5 standard devi- 
ations longer than the more accurate experimental result of 
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Fig. 5 | Comparison of calculated (red) and measured (blue) excited-stateg- 
factors. Shownare results from Glazov et al.*? (i), Verdebout et al.* (ii), Marques 
etal.* (iii), Sachepetnovetal.** (iv), Agababaev et al. (v) and Maison etal.” (vi). The 
error bars of (iii) and (v) are smaller than the data points. Nouncertainty is provided 
for data point (ii). (1), (II) and (III) represent g-factors evaluated from the three 
datasets produced inthis work with their standard uncertainty. The solid blueline 
displays the weighted average with the louncertainty band, with the largest 
contribution coming from the systematic uncertainty. See Methods for details. 


9.573( ee ae )syst Ms obtained from an in-EBIT measurement” and 
than advanced calculations of 9.538(2) ms (ref. *°) and 9.5354(20) ms 
(ref. *1). Our result is consistent with the previous measurement and 
calculations within the uncertainty. Further details are provided in 
Methods. 


Measurement of the excited-state g-factor 

Amagnetic field of about 160 p'T is applied at the location of the ions to 
define a quantization axis and to deliberately split the Zeeman substates 
of *Be* and *°Ar'** onthe megahertz scale. With quantum logic-assisted 
HCI state preparation (see Fig. 2b and Extended Data Fig. 3), all six Zee- 
man components of the *°Ar®*?P,.-P;,. transition can be coherently 
excited, as shown in Fig. 4a. The bottom horizontal axes represent 
the clock laser detuning from the degenerate line centre, and the top 
horizontal axes represent the relative detunings from the centres of the 
individual Zeeman components. We can reconstruct the Zeeman shifts 
of the ?P,,. substates from these data (see Fig. 4b, cand Methods) and 
derive the ratio of the g-factors of the excited and ground states from 
the measured frequencies. Within our current experimental precision 
of a few hertz over a splitting of several megahertz, we do not observe 
any quadratic contribution, which for instance could arise from cou- 
pling of electric field gradients to the electric quadrupole moment of 
the’P,,. state, or froma quadratic Zeeman shift. A quadratic term added 
to the fitting function of Fig. 4b is consistent with zero. Recently, the 
ground-state g-factor g,,. of *°Ar’** was measured in the Penning trap 
experiment ALPHATRAP with an accuracy of parts per billion using 
the continuous Stern-Gerlach method”. Using this value, we obtaina 
weighted average Of 83. = 1.3322895(13) ta(56) sys from three individual 
measurements (see Fig. 5). This is an improvement of more than two 
orders of magnitude over previous in-EBIT measurements’, revealing 
the contributions that arise from special relativity, interelectronic 
interactions and QED to an HCl excited-state g-factor. It also settles a 
discrepancy between previous theoretical values” “’, confirming the 
configuration-interaction calculations of refs. *7*>** and very recent 
coupled-cluster calculations”. 
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Conclusions 

We have cooled HCIs to the ground state of motion in a linear Paul 
trap, making them the coldest HCls prepared in a laboratory so far. 
This enabled us to perform coherent, optical-clock-like laser spectros- 
copy of an electric-dipole-forbidden optical transition in an HCI using 
quantum logic, ata level of precision that is eight orders of magnitude 
higher than the previous state of the art. This proves the feasibility of 
hertz-level optical spectroscopy of HCls and opens up this large class 
of atomic systems to the tools of cutting-edge frequency metrology 
and quantum information processing. 

The determination of the absolute frequency of the Ar fine-struc- 
ture transition witha fractional uncertainty of 3 x 10» and even higher 
levels of precision requires further evaluation of systematic shifts, such 
as the small time dilation shift from the residual motion of the ion“ or 
the electric quadrupole shift’, which is typically suppressed in HCls. 
By restricting measurements to the points of maximum frequency 
sensitivity of each line, frequency information can be obtained faster 
than when scanning the full line profiles, as demonstrated here, further 
reducing the statistical uncertainty at a given averaging time™. At the 
same time, averaging over the Zeeman components onsecond-—rather 
than minute—timescales will suppress systematic uncertainties arising 
from drifting magnetic fields”. 

The presented techniques are not limited to our proof-of-principle 
HCI, *°Ar’**, but can be applied more generally to forbidden transi- 
tions in other HCls. Several of the candidate species have properties 
that are even better suited for optical-clock experiments, including 
much longer excited-state lifetimes and suppressed systematic shifts. 
Certain HCls are particularly sensitive to physics beyond the standard 
model, such as possible variations of the fine-structure constant*, or 
to effects arising from fundamental interactions. Particularly, HClIs 
allow the systematic study of relativistic effects in bound electronic 
systems and of bound-state QED along isoelectronic sequences at 
ultrahigh precision®. 

Furthermore, the techniques that we have demonstrated here are 
not limited to the optical domain. Our work also unlocks the new 
frontiers of the vacuum ultraviolet and X-ray regimes for ultrahigh 
precision spectroscopy—regions of the electromagnetic spectrum that 
are incompatible with neutral and singly charged atoms owing to una- 
voidable photoionization. This will enable novel high-accuracy atomic 
clocks based on HCls and unrivalled tests of fundamental physics. 

We note that during the revision of the manuscript, a complemen- 
tary work demonstrating incoherent laser spectroscopy of *°Ar’** ina 
Penning trap was published”. 
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Methods 


HCI production, transfer and recapture 

We show atop view of the laboratory setup in Extended Data Fig. land 
a simplified schematic of the potential landscape in Extended Data 
Fig. 2a. HCls are produced by electron impact ionization and stored 
by PTB-EBIT, a Heidelberg-type compact EBIT™. After extraction of 
ions in bunches, a beamline with multiple electrostatic elements is 
used to guide the ions towards the Paul trap and to manipulate their 
kinetic-energy distribution. Five segmented einzel lenses’ and an 
electrostatic double-focusing 90° deflector“ are employed for focus- 
ing and steering. A pair of pulsed drift tubes (following the approach 
described in ref. *) is used for deceleration and pre-cooling, reducing 
the phase-space volume of the bunches. Downstream, two microchan- 
nel plate (MCP) detectors can be moved into the ion beam in front of 
and behind the Paul trap to optimize ion yield and beam transmission. 
The first MCP detector also features a retarding field analyser that is 
used to determine the mean kinetic energy and the energy spread of 
the ion bunches. Although this method of HCI production, transfer 
and recapture combining the EBIT, beamline and Paul trap can handle 
a large variety of elements and charge states, the following section 
refers specifically to the present case of optimized *°Ar’* recapture. 

Highly charged argon ions are produced ina distribution of charge 
states using a 13-mA, 1-keV electron beam in the approximately 
50-V-deep axial trapping potential of the EBIT. In each cycle, the cen- 
tral trap electrode is rapidly switched from about 450 V (for the afore- 
mentioned 50-V-deep trap) toa repulsive extraction potential of 700 V 
(200 V higher than the outer trap electrode potential) for a period of 
200 ns at a rate of 4 Hz to eject the ions. The kinetic energy relative to 
the ground potential of the beamline (0 V), and thus the velocity of 
the extracted ions, depend on the total extraction potential of 700 V 
and on the ionic charge g, allowing separation of the different charge 
states by their different times of flight (Extended Data Fig. 2a, b). *°Ar* 
is selected with the help of an electrode of the third segmented einzel 
lens immediately behind the 90° deflector. This electrode acts as a gate 
by rapidly switching to a passing voltage at the *°Ar™ arrival time, and 
back toa deflecting voltage after the ion passage. Thus, the trajectories 
of all other charge states are deflected away from the Paul trap. We 
measure a mean kinetic energy of 694q V with respect to ground for 
the fast *°Ar’** bunches using the retarding field analyser (Extended 
Data Fig. 2e). An associated axial energy spread of 32q V was also deter- 
mined. To decrease the mean kinetic energy and its spread to values 
more amenable for trapping and efficient cooling in the cryogenic 
Paul trap, we perform an electrodynamic deceleration step with the 
pair of pulsed drift tubes. By biasing them to approximately 510 Vand 
590 V before the extraction, a linear axial potential gradient is gener- 
ated on the beamline axis between the two electrodes. Thus, when 
the ion bunch arrives at that position, about 9.7 ps after ion ejection, 
it is exposed to a mean potential of 550 V. Then, both drift tube poten- 
tials are rapidly grounded using a fast high-voltage switch. This slows 
down theion bunch toa kinetic energy of 146g V and reduces the axial 
energy spread to 13q V (Extended Data Fig. 2f). The deceleration step 
also shortens the length of the ion bunches considerably, from about 
5.2cmto about 1.7 cm FWHM, while their temporal width is only slightly 
reduced (Extended Data Fig. 2c, d). After passing through a final einzel 
lens and an unbiased mirror tube, the *°Ar* ions enter the Paul trap. 
The trap voltages are commonly biased to 138 V to accomplish the 
final electrostatic deceleration step. This brings the ions to a residual 
kinetic energy of about 5g Vto 10q V. 

The Paul trap is formed by a radially confining radio-frequency poten- 
tial and an axially confining d.c. potential. Once inside the trap, the HCls 
repeatedly pass through a cigar-shaped Coulomb crystal composed 
of about 50 to 100 *Be’ ions that has been previously loaded into the 
Paul trap using laser ablation combined with photoionization™ (see 
also Fig. 1). This proceeds as follows. During injection into the Paul 


trap, owing to their relatively high kinetic energy, most HCIs can over- 
come the weak axially confining potential of 300 mV (above the biased 
ground of 138 V) applied to the electrostatic endcap at the entrance of 
the trap. After passing through the ’Be* Coulomb crystal for the first 
time, the *°Ar'* ions are reflected by the opposite electrostatic endcap 
potential of about 12 V (above the biased ground of 138 V). Inthe mean- 
time (17.1 ps after initial ion extraction from the EBIT), the mirror tube 
at the entrance of the Paul trap is rapidly switched up to a confining 
axial electrostatic potential to complete the capture of *°Ar™. Then the 
trap remains closed for 1.9 s, during which the HCls can dissipate their 
residual kinetic energy by repeated interactions with the laser-cooled 
°Be’ ions. If these steps are successful for an *°Ar’** ion, it joins the °Be* 
Coulomb crystal (Fig. 1). Otherwise, the mirror-tube potential is lowered 
again to let the next HCI bunch enter the Paul trap. This whole recapture 
process is rather efficient and succeeds in less than 30 s on average. 


Excited-state lifetime measurement 

The data for the lifetime measurement were acquired from 440 meas- 
urements, each of which includes 100 experimental realizations, adding 
up to a total measurement time of about two hours. Eleven measure- 
ments were averaged for every single wait time, with the error bars 
in Fig. 3c indicating the quantum projection noise of 1,100 experi- 
mental implementations. To cancel the effects of parameter drifts on 
the observed signal, the wait time was scanned in a pseudo-random 
sequence. 

Drifts of the atomic resonance frequencies could lead to systematic 
variations inthe detected excitation probabilities. The shortest achiev- 
able 1t-times, 16 1s for the initial HCI excitation and 225 ps for the HCI 
sideband transition, lead to an interaction broadening of the respec- 
tive lines of 62 kHz and 4.4 kHz, respectively. Our typical short-term 
magnetic-field fluctuations lead to line shifts of <10 Hz level, and thus 
affect the measured excited-state population of the order of 10*. The 
axial trap frequency has fluctuations below 100 Hz over the course of 
a day. The distribution of data points for a given wait time is consist- 
ent with the expected quantum projection noise, thereby ruling out 
systematic drifts at the level of the statistical uncertainty. 

The clock laser pulses are generated by the first diffraction order of 
an acousto-optic modulator (AOM). Despite the typical 100-dB level of 
extinction of the radio-frequency drive power provided by an active 
radio-frequency switch, the optical extinction ratio does not reach 
this level owing to scattered light within the AOM crystal. However, 
this leaked light is unshifted by the AOM and therefore detuned from 
the ion resonance by the radio-frequency drive frequency of about 
200 MHz, or 10’ natural linewidths. This alone reduces the de-excitation 
probability by approximately 14 orders of magnitude. 

Spontaneous decay of the HCI onthe red sideband is suppressed as 
the square of the Lamb-Dicke parameter, n* = 0.01. However, residual 
decay on this sideband leads to heating of the motional mode and may 
thus appear as spurious excitation in the quantum logic detection. A 
few sideband cooling pulses applied immediately before the quan- 
tum logic transfer pulse suppress this effect by returning the crystal 
to its ground state. Off-resonant depumping of the excited state of 
4°ar'* by the °Be’ lasers is negligible because of the narrow natural 
linewidth and the large detuning. Collisional deshelving, as discussed in 
refs. *°°, is absent in this experiment owing to the extremely high vac- 
uum. Furthermore, collisions of an HCI witha neutral particle probably 
lead to charge exchange and total, but inconsequential, ion loss. 


g-factor evaluation 

The *°Ar’*?P,, excited-state g-factor, denoted as 93,7, is determined 
by alinear fit of the Zeeman substate energy shifts. We use the well 
known ground-state g-factor of the clock transition froma recent high- 
accuracy measurement” to operate a co-magnetometer and meas- 
ure the magnetic field by an appropriate combination of the Zeeman 
components. 


The energy shifts AF3/y_m,,) = M3/2,i 83/./{,Bof the Zeeman substates 
of the excited ’P3,. state that are due to an external magnetic field Bare 
obtained fromthe measured Zeeman shifts f, (in units of frequency) of 
the six Zeeman components (iranging from 1 to 6, according to Fig. 4c) 
and analogously the shifts AF, m,_, , of the *P,,, Zeeman substates. h 
and p, are the Planck constant and the Bohr magneton, respectively. 
The shifts are referenced with respect to the degenerate line/level cen- 
tres. One then obtains 


AE3/, m3/2,4 eh AE\/2, my/2, (1) 
h ! h 
H, B H, B 
M3/2,; 83/9 rs =f +My i 81 ae (2) 


Bis eliminated from the above equation by using the four inner Zeeman 
components 2-5, which are less sensitive to magnetic-field fluctua- 
tions than the two outer ones. Components /f, and f, share the common 
excited state m3, = —1/2 (see Fig. 4c), and therefore their difference 
yields the ground-state Zeeman splitting directly, without relying on 
the excited-state g-factor. Using the known ground-state g-factor 21/7 
from the work of Arapoglou et al.”, we obtain the magnetic field 


Ath ~f) 
B= 2 3 
EES (3) 


Similarly, components/, and /, share the excited state with m,,) = +1/2, 
and we acquire a second measurement of 


h(f-f,) 


B, = 
81/2 Hp 


(4) 


Introducing U=f;—f, +f, —f, for simplicity, we average the magnetic 
field B from these two relations to reduce the uncertainty 


_Byt+B,  hU 5) 
2 2 81 Hp 
This expression is inserted into equation (2) to obtain 
83). U U 
J(M3/9 j) = ae, M32 =f +m) > (6) 
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On the right-hand side of the equation, the measured shifts of the 
excited Zeeman substates are given, which fulfil a linear relation in 
Ms; (left-hand side of the equation). A linear fit (see black line in Fig. 4b) 
of the form 


YAM3/2,;) = M3), ,+ D (7) 


with offset b to account for the global frequency offset in the measured 
J, allows us to determine the excited-state g-factor g3,. from the slopea 


285/74 


§32-—y (8) 


The uncertainties o,; of the excited Zeeman substates are obtained 
from the right-hand side of equation (6) by expressing U again as 
U=f,-f,+f;,—fi, followed by standard uncertainty propagation with 
the independently measured f; 


(9) 


The uncertainties 0, of the Zeeman components depend on the statis- 
tical uncertainty of the line centre from the fit, OF (fitting the lines by 
Rabiline shapes) and the relative systematic magnetic- -field uncertainty 
o,/B. The latter is time-dependent and is estimated from the observed 
magnetic-field stability measured previously by using the °Be* qubit 
transition frequency (see ref. * for details) to be 4.1 x 10° (measure- 
ment 1) and 3.2 x 10°° (measurements 2 and 3) on relevant timescales. 


Accordingly, one has 
2 
2, (% 
(Bs) 


The linear fit shown in Fig. 4b is weighted with the o,; uncertainties, 
which are displayed in the lower panel. For completeness, we state the 
fit offsets for the three sets of measurements: b = -17(3) Hz, —45(2) Hz 
and -30(2) Hz. The reduced y’ of the linear fits are 1.45, 0.57 and 0.21. 

To estimate the uncertainty ofg;,,., we replace a and Uin equation (8) 


O,= Of, (10) 


by their analytical expressions. a= (m3, ;y,)/ M3). ; isobtained from 
the closed-form solution of a linear fit. The y, values are given by the 
right-hand side of equation (6), and U=f,—f,+f,-f. We can neglect 
the parts-per-billion uncertainty of the experimental result 
$12 = 0.66364845532(93) from the very recent Penning trap measure- 
ment” because it is more than three orders of magnitude smaller than 
our experimental uncertainties in the 25,000-times-weaker magnetic 
field. Finally, the only uncertainties are introduced by the indepen- 
dently measured f,. Thus, the uncertainty Tg,, is obtained from the 
typical formula of uncertainty propagation 


(11) 


The calculated g3,. values are 1.3322989(19) (56) systr 
1.3322897 (23) cat(43) gysc ANG 1.3322828(24) cat(43) sys For the three meas- 
urement sets obtained on two different days, where we have stated 
the statistical and systematic uncertainties separately. The results are 
shown in Fig. 5 together with recent calculations. The uncertainties of 
the individual measurements are the root of the sum of the squared sta- 
tistical and systematic uncertainties. The measurements agree within 
their uncertainties, and the largest deviation between measurement 
land the weighted average is 1.6 standard deviations. We obtain the 
weighted average 93) =1.3322895(13) s:at(56)yst, Where we have combined 
the statistical uncertainties and stated the largest systematic uncer- 
tainty of the individual measurements as a conservative estimate for 
the systematic uncertainty of the average. 


Data availability 


The datasets generated and analysed during this study are available 
from the corresponding author upon reasonable request. 
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Extended Data Fig. 1| Experimental setup. a, Top view of the setup. The 
apparatus extends over two rooms separated by an acoustically insulating wall. 
Inside the ‘machine room’ onthe right-hand side, HCIs are produced in an 
EBIT” and extracted as ion bunches along the ion beam trajectory (blue line) 
through a deceleration beamline. At the laser laboratory (left side), they are 
axially injected into a cryogenic linear Paul trap™*, which is mounted ona 
pneumatically floating optical table (grey-shaded). The Paul trapis 
refrigerated by a vibrationally decoupled pulse tube cryocooler® located in the 
machine room. The beamline is composed of several ion optical elements: five 
segmented einzel lenses and an electrostatic 90° deflector for guiding and 
focusing the ions, a pair of pulsed drift tubes for deceleration, and six 
cylindrical electrodes arranged in line in front of and behind the Paul trap. 
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Charge-state separation is accomplished by the different times of flight 
through the beamline. One electrode of the third segmented einzel lens is used 
as agate to select the desired charge state. An MCP detector in front of the Paul 
trap includes two fine stainless-steel meshes that apply a well defined retarding 
field, and allows the measurement of the kinetic-energy distribution of the ion 
bunches (see also Extended Data Fig. 2e, f). Asecond MCP detector behind the 
Paul trapis used to optimize the ion beam transmission through the Paul trap. 
b, Magnified side view of the cryogenic Paul trap region. The trap (photograph) 
is shown with the two adjacent electrostatic tubes. The left one (mirror tube) at 
the entrance of the Paul trap is used to capture the HClIs by rapidly switching to 
a confining potential once the HCIs have passed it. Photograph: Physikalisch- 
Technische Bundesanstalt. 
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Extended Data Fig. 2| HCl extraction and transfer. a, Simplified illustration of 
the electrostatic potential used for the *°Ar'* transfer from the EBIT to the Paul 
trap. The entire ion inventory stored inthe EBIT, with its charge-state 
distribution displayed as grey-shaded, is ejected by switching the axial trap toa 
repulsive potential. The charge states separate owing to their distinct initial 
kinetic energies. *°Ar’ ions (red) are selected by an electrode used asa gate 
(not shown). The fast *°Ar* bunch is then slowed down uponentering the 
pulsed drift tubes. Having arrived there at the centre of alinear potential 
gradient, the electrode potentials are rapidly switched to ground, andaslower 
4ar'5* bunch leaves the pulsed drift tubes. At the Paul trap, the ions are further 
decelerated by an electrostatic potential and enter the trapping region witha 
reduced residual kinetic energy of 5g Vto 10q V. They then pass a Coulomb 
crystal of °Be* ions andare reflected by an electrostatic endcap electrode 
biased toa potential of about 12 V above the biased common ground. 
Meanwhile, an electrostatic mirror tube in front of the Paul trap has been 
switched up toa confining potential at which *°Ar'*" is unable to escape the Paul 
trap. This causes an oscillatory motion along the trap axis. Through repeated 
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interactions with the laser-cooled *Be’ ions, *°Ar'* dissipates its residual 
kinetic energy and joins the Coulomb crystal. b, Normalized ion yield asa 
function of the time of flight after ion ejection from the EBIT, measured by the 
first MCP detector in front of the Paul trap. The black curve shows the entire 
charge-state distribution, with Ar charge states from +7 through +15. Using the 
gate electrode, *°Ar'* is chosen for passage, as shown by the red curve. a.u., 
arbitrary units. c, d, Normalized *°Ar* bunches as a function of time and 
position along the beamline axis (averaged over 16 shots). The FWHM of the fast 
bunchis about 250 ns (c) and that of the slow bunchis about 185 ns (d). 

e, f, Normalized kinetic-energy distributions of the *°Ar* bunches along the 
beamline axis: fast bunch (e) and slow bunch after deceleration and phase- 
space cooling using the pulsed drift tubes (f). The red circles show the 
integrated ion yield of an averaged *°Ar™ bunch (16 shots) fora given 
retardation potential, measured by the retarding-field analyser. A Gaussian 
error function (red line) was fitted to the data and differentiated to obtain the 
Gaussian energy distribution (blue line) to show the mean kinetic energy and 
longitudinal energy spread. 
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Extended Data Fig. 3 | Quantum logic-assisted internal state preparation of 
Ar". The m,.=—1/2 state of the?P,,, level is deterministically populated by a 
series of five clock laser sideband -pulses (1-5), which excite the two-ion 
crystal from the motional ground state |0),, (solid lines) into the excited state 
1)» (dashed lines). By means of Raman sideband cooling pulses acting on the 
°Be’ ion, the crystal is returned to the motional ground state after each transfer 
pulse. This ensures unidirectional optical pumping’. To increase the state- 
preparation efficiency, this sequence is repeated four times. The other Zeeman 
ground state (P,/, m,.=+1/2) is prepared in an analogous manner. 
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The discovery of superconductivity at 200 kelvin in the hydrogen sulfide system at 
high pressures’ demonstrated the potential of hydrogen-rich materials as high- 
temperature superconductors. Recent theoretical predictions of rare-earth hydrides 
with hydrogen cages”? and the subsequent synthesis of LaH,, with a superconducting 
critical temperature (7,) of 250 kelvin** have placed these materials on the verge of 
achieving the long-standing goal of room-temperature superconductivity. Electrical 
and X-ray diffraction measurements have revealed a weakly pressure-dependent 7, for 


LaH,, between 137 and 218 gigapascals in a structure that has a face-centred cubic 
arrangement of lanthanum atoms>. Here we show that quantum atomic fluctuations 
stabilize a highly symmetrical Fm3mcrystal structure over this pressure range. The 
structure is consistent with experimental findings and has a very large electron- 
phonon coupling constant of 3.5. Although ab initio classical calculations predict that 
this Fm3mstructure undergoes distortion at pressures below 230 gigapascals””, 
yielding a complex energy landscape, the inclusion of quantum effects suggests that it 
is the true ground-state structure. The agreement between the calculated and 
experimental 7, values further indicates that this phase is responsible for the 
superconductivity observed at 250 kelvin. The relevance of quantum fluctuations 
calls into question many of the crystal structure predictions that have been made for 
hydrides within a classical approach and that currently guide the experimental quest 
for room-temperature superconductivity® ®. Furthermore, we find that quantum 
effects are crucial for the stabilization of solids with high electron-phonon coupling 
constants that could otherwise be destabilized by the large electron-phonon 
interaction’, thus reducing the pressures required for their synthesis. 


The potential of metallic hydrogen as a high-7, superconductor” 
was identified shortly after the development of Bardeen-Cooper- 
Schrieffer theory, which explains superconductivity using the elec- 
tron-phonon coupling mechanism. The main argument in favour of 
metallic hydrogen was that TJ. can be maximized for light compounds 
owing to their high vibrational frequencies. Because high pressures 
are required to metallize hydrogen", chemical precompression with 
heavier atoms!”” was suggested as a pathway by which to decrease the 
pressure needed to reach metallicity and, therefore, superconductivity. 
These ideas have been realised using modern ab initio crystal structure 
prediction methods based on density functional theory (DFT)”*». 
Hundreds of hydrogen-rich compounds have been predicted to be ther- 
modynamically stable at high pressures, and their T. values have been 
estimated by calculating the electron-phonon interaction®’. The suc- 
cess of this co-operation between DFT crystal-structure predictions and 
T, calculations was exemplified by the discovery of superconductivity 


in HS at 200 K""*””. The prospects for discovering hydrogen-based 
high-T, superconductors in the near future are therefore high, with 
rare-earth hydrides with sodalite-like clathrate structures showing 
particular promise’. This is in clear contrast to other high-7, super- 
conducting families such as cuprates or pnictides, in which the lack 
of aclear understanding of the superconducting mechanism hinders 
an in silico-guided approach. 

DFT predictions of the La-H system proposed LaH,, to be thermo- 
dynamically stable against decomposition at high pressures”. A soda- 
lite-type structure with space group Fm3m and T, = 280 K at pressures 
greater than around 220 GPa was suggested as a candidate for high-T, 
superconductivity”’ (Fig. 1). Distorted versions of the Fm3mstructure 
with space group C2/mand arhombohedral lanthanum sublattice were 
also discussed’’, and shortly after the first predictions’, alanthanum 
superhydride was synthesized by heating a lanthanum sample witha 
laser ina hydrogen-rich atmosphere inside a diamond anvil cell (DAC)”. 
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Fig.1| Quantum effects stabilize the symmetric 
Fm3m phase of LaH,).a, Enthalpy as a function of 
pressure for different structures of LaHjo, 
neglecting zero-point energy in the calculations. 
Here, pressure is calculated classically from V(R), 
neglecting quantum effects on V(R). b, Left, a Born- 


1 Oppenheimer energy surface V(R), exemplifying 
the presence of many local minima belonging to 
distorted structures. Rrepresents the positions of 
atoms treated classically as simple points. Right, the 
configurational energy surface E(R), including 
quantum effects. R represents the quantum 
centroid positions, which determine the centre of 
the ionic wave functions—that is, the average atomic 
positions. By including quantum effects, all phases 
collapse toa single phase: the highly symmetric 
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Classical description 


On the basis of the unit cell volume obtained by X-ray diffraction, the 
hydrogen-to-lanthanum ratio was estimated to be between 9 and 12. 
The lanthanum atoms adopted a face-centred cubic (fcc) arrangement 
at pressures greater than about 160 GPa, whereas at lower pressures 
the lattice was rhombohedral with a lanthanum sublattice of the R3m 
space group. Owing to the small X-ray cross-section of hydrogen, it is 
not experimentally possible to resolve the hydrogen sublattice directly. 
More recently, evidence of asuperconducting transition at 260 K and 
188 GPa was reported ina lanthanum superhydride*”’. These findings 
were subsequently confirmed by the measurement ofa 7, of 250 K from 
137 to 218 GPa ina structure with an fcc arrangement of the lanthanum 
atoms, suggesting a LaH,, stoichiometry?. 

Although it is tempting to assign the superconductivity at 250 K to 
the previously predicted Fm3m phase? >, there is a clear problem: the 
Fm3m structure is predicted to be dynamically unstable over the whole 
pressure range in whicha 250 K 7, has been observed. This would imply 
that the Fm3m phase is not a minimum of the Born-Oppenheimer 
energy surface, and consequently a [, has not been estimated for this 
phase in the experimental pressure range. The contradiction between 
the observation of superconductivity and the predicted instability of 
the Fm3m phase may indicate a problem with the classical treatment 
of the atomic vibrations in the calculations. Considering that quantum 
proton fluctuations symmetrize hydrogen bonds inthe high-pressure 
X phase of ice” and in H,S””’, a similar situation is expected in LaH,o. 
Here we show how quantum atomic fluctuations completely reshape 
the energy landscape by removing classical local minima, rendering 
the Fm3m phase the true ground state and the state responsible for 
the observed superconducting critical temperature. 

We start by using DFT to calculate the lowest-enthalpy structures of 
LaH,, asa function of pressure, using state-of-the-art methods for the 
prediction of crystal structure”*. The contribution associated with 
atomic fluctuations is not included, so that the energy corresponds 
solely to the Born—Oppenheimer energy V(R), where Rrepresents the 
position of atoms treated classically as simple points. As shown in Fig. 1, 
different distorted phases of LaH,, are thermodynamically more stable 


: Fm3m. 


With quantum effects 


than the Fm3m phase. At pressures greater than about 250 GPa, all 
phases merge tothe fm3m symmetric phase. These results are in agree- 
ment with previous calculations’, even though we identify other pos- 
sible distorted structures with lower enthalpy—such as the R3m, C2 
and P1 (not shown) phases. These phases feature distortion not only 
inthe position of the hydrogen atoms but also in the lanthanum sublat- 
tice, leading to anon-fcc arrangement that should be detectable by 
X-ray analysis (see Extended Data Fig. 1). The fact that many structures 
are predicted emphasizes that the classical V(R) energy surface has a 
multifunnel structure that is tractable to many different saddle and 
local minima, as shown in Fig. 1. 

This picture completely changes when we include the energy of 
quantum atomic fluctuations—the zero-point energy. We calculate the 
zero-point energy within the stochastic self-consistent harmonic 
approximation (SSCHA)**”*. The SSCHA is a variational method that 
calculates the energy of the system (E(R)) including atomic quantum 
fluctuations as a function of the centroid positions R, which determine 
the centre of the ionic wave functions. The calculations are performed 
without approximating V(R), keeping all of its anharmonic terms. We 
performa minimization of E(R) and determine the centroid positions 
at its minimum. By calculating the stress tensor from E(R) (ref. 7°), 
we relax the lattice parameters in order to find structures with isotropic 
stress conditions considering quantum effects. We start the quantum 
relaxation for both R3m and C2 phases with the lattice that yields a 
classical isotropic pressure of 150 GPa and vanishing classical forces— 
that is, calculated from V(R). All quantum relaxations quickly evolve 
into the Fm3mphase (Extended Data Fig. 4). This suggests that the quan- 
tum energy (E(R)) landscape is much simpler than the classical V(R) 
landscape, as shown in Fig. 1, and that the ground state of LaH,) over 
the pressure range of interest is the Fm3m phase with sodalite-type 
symmetry. The quantum effects are substantial, reshaping the 
energy landscape and stabilizing structures by more than 60 meV 
per LaH,o. 

Our results further confirm that the Fm3m phase of LaH,, is respon- 
sible for the superconductivity at 250 K. This is consistent with the fcc 
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Fig. 2 |The phonon band structure of Fm3m LaH,, at different pressures. The 
harmonic phonons showlarge instabilities in several regions of the Brillouin 
zone. Only at the high-pressure limit—for example, at pressures greater than 
220-250 GPa— is dynamic (harmonic) stabilization reached. The anharmonic 
phonons obtained from the Hessian of the quantum energy F(R) within the 
SSCHA are dynamically stable over the experimentally relevant pressure range. 


arrangement of lanthanum atoms that is found experimentally*. We 
verified that the experimental resolution reported? is sufficient to 
discard the classically obtained distorted structures (Extended Data 
Fig. 1). However, another study” observed arhombohedral distortion 
at pressures lower than about 160 GPa, in which the lanthanum sublat- 
tice occupied the R3m space group and the rhombohedral angle was 
approximately 61.3° (c/a = 2.38 in the hexagonal representation). Our 
calculations show that this distortion is compatible with the hypoth- 
esis of slight anisotropic stress, which could be present in some exper- 
iments inside the DAC. Indeed, by performing an SSCHA minimization 
for the R3m phase but keeping the rhombohedral angle fixed at 62.3° 
(the value that yields an isotropic pressure of 150 GPa at the classical 
level), the quantum stress tensor shows a 6% anisotropy between the 
diagonal direction and the perpendicular plane. This suggests that 
anisotropic conditions inside the DAC can produce the R3m phase, 
although we cannot rule out the possibility that other experimental 
stress conditions could favour other crystalline phases. 

The phonon spectra of the fm3m phase, calculated in the harmonic 
approximation from the Hessian of V(R), show clear phonon insta- 
bilities ina broad region of the Brillouin zone (Fig. 2). These instabilities 
appear at pressures lower than about 230 GPa, which is consistent with 
the finding that many possible atomic distortions lower the enthalpy 
of this composition at these pressures. Conversely, when the calcula- 
tionis performed using the Hessian of F(R) (ref.”°)—which effectively 
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In the case of deuterium, an instability develops at low pressures (126 GPa), 
whichis consistent with experimental evidence. The pressure stated 
corresponds to that calculated from F(R), which considers quantum effects. 
The grey shading marks the regions with imaginary phonon frequencies, which 
are depicted as negative frequencies. 


captures the full anharmonicity of V(R)—no phonon instability is 
observed (Fig. 2). This again confirms that the Fm3m phase is a mini- 
mum inthe quantum-energy landscape over the whole pressure range 
in which superconductivity at 250 K was observed. Whereas the Fm3m 
phase of LaH,) remains a minimum of F(R) at pressures as low as 129 
GPa, instabilities are seen at 126 GPa in the case of LaD,o. This implies 
that—at this pressure—the Fm3m phase of LaD,, distorts to a new 
phase, as has been suggested previously”. 

The breakdown of the classical harmonic approximation for phonons 
hinders the estimation of T, at pressures lower than around 230 GPain 
the Fm3m phase. It also calls into question the certainty of harmonic 
calculations at higher pressures”, considering that large anharmonic 
effects are persistent well above 260 GPa (Fig. 2). However, with anhar- 
monic phonons derived from the Hessian of E(R), we can readily cal- 
culate the electron-phonon interaction and the superconducting T, 
over the experimental pressure range (137-218 GPa). T, is estimated 
fully ab initio—without any empirical parameter—by solving Migdal- 
Eliashberg equations and applying superconducting DFT (SCDFT). As 
shownin Fig. 3, the numerical solutions of Migdal-Eliashberg equations 
with an anisotropic energy gap match well with the experimental values. 
The values obtained from SCDFT calculations systematically show a 
slightly lower 7,. Our reported values of 7, provide evidence for the 
phonon-driven mechanism of superconductivity, and confirm that the 
Fm3m structure of LaH,, is responsible for what is, to our knowledge, 
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Fig.3|Summary of experimental and theoretical 7, values. Superconducting 
critical temperatures calculated using anisotropic Migdal-Eliashberg 
equations and SCDFT. In both cases the anharmonic phonons obtained with 
the SSCHA were used. The results are compared with the experimental values 
measured in refs. *>. The error bars inthe data from ref.° correspond to the 
experimental uncertainty in the determination of T.. 


the highest 7, reported to date. Our calculations for LaD,, inthe fm3m 
phase are also in agreement with the experimental data point reported. 
Despite the presence of large anharmonic effects, the isotope coeffi- 
cient a=—[In7(LaD,,) — In7,(LaH,9)]/In2 is close to 0.5 (0.43 at around 
160 GPa)—as expected by Bardeen-Cooper-Schrieffer theory—and is 
in agreement with the experimentally reported value of a= 0.46. 

Finally, we also calculated 7, in the presence of the subtle rhombo- 
hedral distortion that can be induced in experiments by anisotropic 
pressure conditions. When the rhombohedral angle is fixed at 62.3°, 
the 7, obtained for the R3m phase at 160 GPa is 9% lower than for the 
Fm3m phase. The observed weak pressure dependence of 7, is there- 
fore consistent with the absence of arhombohedral distortion, as 
suggested by the X-ray data’. However, as argued above, undesired 
anisotropic stress conditions in the DAC can induce phase transitions. 
For cases in which measurements of 7, have yielded lower values 
(around 200K), itis highly probable that the corresponding structures 
are distorted as a result of anisotropic pressure conditions. We can 
also safely rule out the possibility that compositions such as LaH,,— 
which are proposed to havea high superconducting critical tempera- 
ture°—are responsible for T. values in the range observed here 
(Extended Data Fig. 8). 

In summary, this work demonstrates the importance of quantum 
effects in determining the ground-state structures of superconduct- 
ing hydrides—challenging current predictions and evidencing flaws 
in standard theoretical methods. Similar effects are expected in other 
high-7, compounds with hydrogen clathrate structures, for which 
syntheses have recently been reported”* *°. We also illustrated that 
quantum fluctuations are essential in order to sustain crystals with 
large electron-phonon coupling constants (the value of 3.6, found 
at 129 GPa for LaH,,, is to our knowledge the highest reported); such 
structures would otherwise be destabilized by the substantial elec- 
tron-phonon interaction, resulting in distorted (low-symmetry) 
structures in which the electronic density of states at the Fermi level 
is reduced? (Extended Data Fig. 7). Our results may therefore help 
to increase the prospect of attaining high- 7, superconductivity in 
hydrogen-based structures at much lower pressures than would be 
expected classically. 
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Methods 


Calculation details 

First-principles calculations were performed within DFT and the gen- 
eralized gradient approximation (GGA) as parametrized by Perdew, 
Burke and Ernzerhof (PBE)*. Harmonic phonon frequencies were 
calculated within density functional perturbation theory (DFPT)” 
making use of the Quantum ESPRESSO code®**. The SSCHA>7®936 
minimization requires the calculation of energies, forces and stress 
tensors in supercells. These were also calculated within DFT at the 
PBE level with Quantum ESPRESSO. For the final SSCHA populations, 
1,000 configurations were used to reduce the stochastic noise. In all 
calculations we used ultrasoft pseudopotentials including 11 electrons 
for the La atoms, a plane-wave cut-off energy of 50 Ry for the kinetic 
energy and 500 Ry for the charge density. 

In the harmonic phonon calculations for the Fm3m and the R3m 
phases, we used the primitive and rhombohedral lattices, respectively, 
with one LaH,, formula unit in the unit cell. A20 x 20 x 20 Monkhorst- 
Pack shifted electron-momentum grid was used for these calculations 
with a Methfessel-Paxton smearing of 0.02 Ry. The DFT calculations 
performed for the SSCHA on supercells were performed ona coarser 
electron-momentum grid, which would correspond to a 12 x 12 x 12 
grid inthe unit cell. We explicitly verified that this coarser mesh yields 
afully converged SSCHA gradient with respect to the electron-momen- 
tum grid, thus not affecting the SSCHA minimization. The DFT super- 
cell calculations for the SSCHA minimization on the C2 phase were 
performed keeping the same k-point density. 

All phonon frequencies for q-points that were not commensurate 
with the supercell used in the SSCHA minimization were obtained by 
directly Fourier-interpolating the real space force constants obtained 
in this supercell, which are calculated form the Hessian of F(R). For 
the Fm3m phase, the SSCHA calculation was performed both ona 
2x2x2andona3 x3 x3 supercell containing, respectively, 88 and 297 
atoms. The phonon spectra shown in Fig. 2 for the fm3m phase were 
obtained by directly Fourier-interpolating the SSCHA energy Hessian 
force constants obtained ina3 x3 x3 supercell. In Extended Data Fig. 2 
we show that the phonon spectrum obtained by directly interpolating 
the force constants in a2 x 2 x 2supercell yields similar results, indicat- 
ing that the energy Hessian force constants are short-range and can 
be Fourier-interpolated. Indeed, the 7, calculated with the 2 x 2 x 2and 
3 x 3 x 3 force constants for interpolating phonons differs by 
only around 3 K. Because the value of estimated T, only negligibly 
depends on the cell size for the Fm3m phase, the SSCHA quantum 
structural relaxations in the R3m and C2 phases were performed in 
2x2x2 supercells with 88 atoms. 

As shown in ref.”°, the Hessian of E(R) is 


0E(R) @ ( @ )'® 
= - 1 
wae TTC AE OA @ (1) 


Bold notation represents matrices and tensors in compact notation. 
In equation (1), @ are the variational force constants of the SSCHA 
(n) 


minimization, ® the quantum statistical averages taken with the SSCHA 
density matrix of the nth order derivatives of VR), and A a tensor that 
depends onthetemperature and ®. lis the identity matrix. As we show 


in Extended Data Fig. 2, setting ® =O has anegligible effect onthe pho: 


nons obtained from the Hessian defined in equation (1). Therefore, ® 
isneglected throughout, and all superconductivity calculations inthe 
Fm3m and R3m phases are performed making use of the phonon fre- 
quencies and polarization vectors obtained from the Hessian of E(R) 


with ® = 0. We also estimated 7, with the phonon frequencies and 
polarization vectors obtained instead from @, resulting in a critical 
temperature 12 K lower within the Allen—Dynes-modified McMillan 


equation. This difference is small and within the uncertainty of the 7, 
calculation between SCDFT and anisotropic Migdal-Eliashberg calcu- 
lations (see Fig. 3 and below). 

The Eliashberg spectral function, which we used for the 7, calcula- 
tions, is defined as 


1 2 
a?F(w) = — v 6(e,,-F, 
to) Ne. nha Srncmterg) OC nk~ Fr) (2) 


X6(Emk+g— Ep) O(@ — Way) 


where Nr, is the electronic density of states (DOS) at the Fermi energy 
(E,), nand mare band indices, kis a crystal momentum, €,, is a band 
energy, Wg, is the phonon frequency of mode vat wavevector q, and 
Bria’ the electron-phonon matrix element between a state 
nk and mk + q. We calculated a?F(w) combining the SSCHA phonon 
frequencies and polarization vectors obtained from the Hessian of 
E(R)with the electron-phonon matrix elements calculated with DFPT. 
For the fm3mand R3mphases, the electron-phonon matrix elements 
were calculated in a 6 x 6 x 6 q-point grid and a 40 x 40 x 40 k-point 
grid. These were combined with the SSCHA phonons and polarization 
vectors obtained by Fourier interpolation to the 6 x 6 x 6 q-point grid 
from the real space force constants coming from the Hessian of E(R) 
ina3x3x3 supercell for the fm3m phase and ina 2 x 2 x 2 supercell for 
the R3mphase. The Dirac deltas on the band energies are estimated by 
substituting them with a Gaussian of width 0.004 Ry. The calculated 
a’ F(w) functions for the Fm3mphase are shown in Extended Data Fig. 3, 
and in Extended Data Fig. 5 we show the results for the R3m phase. 


Crystal phase diagram exploration 

To sample the enthalpy landscape of LaH,, we used the minima-hopping 
method”, which has been successfully employed for global geom- 
etry optimization in a large variety of applications—including super- 
conducting materials such as H,S, PH;, and elemental solids at high 
pressure*® *°. This composition was thoroughly explored with 1, 2, 3 
and 4 formula unit simulation cells. Variable composition simulations 
were also performed for other La-H compositions. Energy, atomic 
forces and stresses were evaluated at the DFT level with the GGA-PBE 
parametrization to the exchange-correlation functional. A plane wave 
basis-set with a high cut-off energy of 900 eV was used to expand the 
wave function together with the projector-augmented wave method 
as implemented in the Vienna Ab initio Simulation Package (VASP)*!. 
Geometry relaxations were performed with tight convergence criteria 
such that the forces on the atoms were less than 2 meV A‘ and the 
stresses were less than 0.1eV A®. Extended Data Fig. 1 shows our calcu- 
lated convex hull of enthalpy formation without considering the zero- 
point energy at 100, 150 and 200 GPa. Notably, there are many stable 
compositions in the convex hull. LaH,, becomes enthalpically stable 
(classically) at around 175 GPa and remains in the convex well above 
300 GPa. We have verified that LaH,, (Fm3m) does not decompose 
into LaH; (Cmcm) and LaH,, (P4/nmm) by around 0.3 eV per LaH,, once 
quantum effects are included in the calculation of the enthalpy forma- 
tion at 150 GPa, contrary to the conclusion drawn by classical calcula- 
tions in ref. . Below 150 GPa, R3mand C2 phases (LaH,,) show unstable 
harmonic phonons at I, becoming saddle points of V(R). However, 
harmonically one can find P1 stable structures (decreasing symmetry) 
by following the instability pattern (softening direction—thatis, along 
eigenvector polarization). Pl structures are degenerate in enthalpy 
within less than 3 meV per LaH,, with respect to C2. We therefore used 
the C2 as arepresentative of highly distorted structures for our study. 


Superconductivity calculations in the Fm3m phase 
Superconductivity calculations were performed within two differ- 
ent approaches that represent the state-of-the-art of ab initio super- 
conductivity: SCDFT and the Eliashberg equations with full Coulomb 
interaction. 


SCDFT is an extension to DFT for asuperconducting ground state”. 


By assuming that the nk anisotropy in the electron-phonon coupling 
is negligible (see ref. * for further details), the critical temperature is 
computed by solving an (isotropic) equation for the Kohn-Sham gap: 


tanh lea A | 


, (3) 
Fe) As) 


A,(€) = Z(€)A,(€) - fde’K(e, €’) 
where cis the electron energy and fthe inverse temperature. The kernels 
Kand Zcome fromthe exchange correlation functional of the theory? ® 
and depend on the properties of the pairing interactions: electron- 
phonon coupling and screened electron-electron repulsion. Equation (3) 
enables us to calculate 7, completely ab initio, without introducing an 
empirical u* parameter (Coulomb pseudopotential). Dynamic 
effects on the Coulomb interaction (plasmon) were also tested and did 
not show any substantial effect. In its isotropic form, the screened 
Coulomb interaction in SCDFT is accounted for by a function p(e, €’), 
which is given by the average” random phase approximation (RPA) 
Coulomb matrix element on the iso-energy surfaces € and €’ times the 
DOS ate’ (Me’)): 


ppa O(E- Ent) 


He, e’)= Y [PPK Ve NG.) 


O(E’ - Em) (4) 


The full energy dependence of the DOS is accounted for in the calcu- 
lations, whereas the electron-phonon coupling is described by the 
a’F(w) of equation (2). 

The second approach we use to simulate the superconducting state 
is the anisotropic Eliashberg approach”’. Here we include, together 
with the energy dependence of the electron DOS, the anisotropy of the 
electron-phonon coupling. The Green’s function form of the Eliashberg 
equation we solve is given as 


P 1 ‘ : ; 
Zn i@;) = NB x vena, 10) G+ qli@, + i@;) (5) 
gm 


: 1 F ; 
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Here, 2,,(i@;) and 4,,(i@;) are the normal and anomalous self energy, 


and VP" (q, iw,) and VC (q, iw,) are the k-averaged phonon-mediated 


interaction and Coulomb interaction, respectively. The explicit form 
of VP, (q, i@,) is given as 


Vinn(@, i@,) => lg (@)?D,(q, io,) (7) 


where Ig’ _(q)I? isak-averaged electron-phonon matrix element 


Yk 
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ev,,(a)| = 


and D,(q, iw,) is a free-phonon Green’s function, 
D,(q, i@,) = - 2Wgy/(w}, + Way) The electron-phonon matrix elements 
are calculated through a DFPT calculation with 6 x 6 x 6 q-point grid 
and are combined with the phonon frequencies and polarization vec- 
tors obtained by directly Fourier-interpolating to this grid the force 
constants arising from the E(R) Hessian in the 3 x 3 x 3 supercell. For 
the Coulomb interaction, VS,,.(q, i@,) is approximated by k-averaged 
static Coulomb interaction within the random phase approximation, 
Nc Le Vie nk+q(i@, = 0) . Using equation (5), the Dyson equation was 
solved self-consistently and then equation (6) was solved to estimate 
T, with 36 x 36 x 36 k-point grid and 512 Matsubara frequencies. 


In Extended Data Table 1 we summarize all calculated 7, values 
within anisotropic ME and isotropic SCDFT. We also include the val- 
ues obtained with the McMillan equation and the Allen-Dynes-mod- 
ified McMillan equation (u* = 0.1). The calculated electron-phonon 
coupling constant, A = Zi dwa?F(w)/w@ and the logarithmic frequency 


AVEFAKE, dog = eXD|j in dy Fo loge } are also included inthe table. 


Quantum structural relaxations in the R3mand C2 phases 

In Extended Data Fig. 5 we show the evolution of the pressure calculated 
along the different Cartesian directions for the R3m throughout the 
SSCHA minimization but keeping the rhombohedral angle fixed at 
62.3°. Thus, the centroid positions R are optimized only considering 
the internal degrees of freedom of the R3m phase. Evenif, at the clas- 
sical level, the stress is isotropic (within a 0.5%), after the SSCHA quan- 
tum relaxation an anisotropic stress of a 6% is created between the z 
andx-y directions. The phonons obtained at the end of the minimiza- 
tion are shown in Extended Data Fig. 5. Second, in Extended Data Fig. 4, 
we showthat starting from the result of this minimization but now also 
relaxing the lattice, the R3m phase evolves into the Fm3m phase. It is 
clear how the pressure calculated with quantum effects becomes iso- 
tropic when the rhombohedral angle becomes 60°, the angle corre- 
sponding to an fcc lattice in a rhombohedral description. It is also 
evident that the Wyckoff positions of the R3mphase evolve clearly into 
the Fm3m Wyckoff positions, which are summarized in Extended Data 
Table 2. 

In Extended Data Fig. 6 we show the evolution of the diagonal compo- 
nents of the pressure along the three different Cartesian directions for 
the monoclininc C2 when the lattice structure is relaxed with the SSCHA. 
The starting point is obtained by first performing a SSCHA relaxation 
of only internal atomic coordinates, keeping the lattice parameters 
that yield anisotropic stress of 150 GPa. Itis clear that quantum effects 
create an anisotropic stress if the lattice parameters are not modified. 
When the quantum relaxation of the lattice is performed, the lattice 
parameters are modified and an isotropic stress is recovered. 

Extended Data Fig. 7 shows the structures of the R3mand C2 phases 
obtained classically and after the quantum SSCHA relaxation. After 
the quantum relaxation, the symmetry of both structures is recognized 
as Fm3mwitha tolerance of 0.001 A for lattice vectors and 0.005 A for 
ionic positions, consistent with the stochastic accuracy of the SSCHA. 
Inthesame figure, the electronic DOS as a function of pressure is plot- 
ted. A highly symmetric motif (Fm3m) maximizes Ne» whereas in dis- 
torted structures (R3m and C2) the occupation at the Fermi level is 
reduced by more than 20%. This underlines that the classical distortions 
would lower N;., reducing A, as expected ina system that is destabilized 
by the electron-phonon interaction. 


Transition temperatures from other La-H compositions 
Different compositions onthe La-H phase diagram have been reported 
to be thermodynamically stable. Presumably, the stabilization of these 
compositions and the measurement of different T, values (see ref. *) 
demonstrate that other stoichiometries are responsible for these meas- 
ured T. values. Notably, these 7, values appear substantially lower—for 
instance, the values decrease from 250 K, to 215 K, 110 K and to 70K. 
Experimentally there is not aclear correlation between sample prepara- 
tion, 7, and pressure. In the sample preparation in ref.°, pressures can 
vary from 100 to 200 GPa (gradient inside the DAC) and it was proposed 
that other stoichiometries (low-hydrogen content) are responsible for 
systematically lower values of T,. 

Conversely, ina later publication, the same authors suggested that 
another hydrogen-rich system that is enthalpically competitive (LaH,,) 
could possibly be responsible for other high-T, values. In order to verify 
this hypothesis, we considered structure-prediction runs with this 
stoichiometry, and found crystalline structures that were previously 
reported in ref. 3. Extended Data Fig. 8 shows the structural motif and 


Article 


the corresponding phonons and a’F(w) spectral function. We can rule 
out the possibility that high 7, values, as measured in different sam- 
ples, arise from LaH,, in its P4/nmm (129) structure (lowest enthalpy 
structure for this composition at relevant experimental pressures). AS 
seen in Extended Data Fig. 8, this phase has a strong molecular-crystal 
character, composed of H, units weakly interacting with the La lattice. 
This phase is indeed a poor metal—with low occupation of electrons at 
the Fermi level—owing to its molecular character, and it cannot explain 
T, values of 70 K or higher. Our estimated 7, with the Allen—Dynes for- 
mula, harmonic phonons and using a* = 0.1is 7 K at 100 GPa, reaching 
around 24 K at 200 GPa. More importantly, this phase does not show 
marked anharmonicity. 


Data availability 


All the data generated in this work is available upon request from I.E. 
andJ.A.F.-L. 


Code availability 


Quantum ESPRESSO is an open-source suite of computational tools 
available at https://www.quantum-espresso.org. VASP is a proprietary 
program. The SSCHA and the SCDFT codes are private codes developed 
by some of the authors, and are being prepared for distribution as an 
open-source code. 
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Extended Data Fig. 1| Convex hull of enthalpy formation and diffraction 
pattern of candidate LaH,, phases. Top, classical calculations of enthalpy 
(without zero-point energy) at different hydrogen contents at 100, 150 and 200 
GPa. At low pressure (100 GPa), LaH,) is not stable and only develops as stable 
point in the convex hull of enthalpy formation at pressures above about 175 
GPa. Bottom, diffraction patterns of different structures at 150 GPa (classical 
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pressure), compared to the experimental data reported in ref. ° for LaH,, inthe 
Fm3m phase at 150 GPa. The pattern is shown in the vicinity of the (111) peak of 
the Fm3m phase. This peak is clearly split in the distorted C2 and R3m phases 
predicted classically. The figure provides confirmation that the experimental 
resolution in ref.° would have been sufficient to distinguish between these 
phases. 
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force constants obtained from the Hessian of E(R)inarealspace2x2x2anda phonon spectra correspond to that arising directly from the SSCHA variational 
3x 3x3 supercell. The similarity of both phonon spectra obtained by Fourier 
interpolation indicates that these SSCHA force constants are short-ranged and 
can be Fourier-interpolated. Right, phonon spectra obtained from the SSCHA 
energy Hessian of equation (1), making different level of approximations. The 
purple solid line is the phonon spectrum calculated with the full-energy 


(3) 
force constants @. These results clearly show that whereas the effect of ® is 


important, setting ® =O has minimal effect. All phonon spectra are obtained 
by directly Fourier-interpolating the real space anharmonic force constantsina 
2x2x2supercell. 
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Extended Data Fig. 3 | a?F(w) values for the Fm3m phase of LaH,, and LaD,9. 
Calculated a?F(w) values for different pressures together with the integrated 
electron-phonon coupling constant, which is defined as A(w) = 255 ddQa?F(Q)/O. 
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responsible for the large value of the electron-phonon coupling constant /. It is 
worth noting that acoustic modes with La character contribute between 0.2 and 
0.5toA and cannot be neglected when aiming to estimate an accurate value of 7.. 
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Extended Data Fig. 4 | Details of the R3m LaH,, cell relaxation, including 
quantum effects. The initial point for the relaxation is the output from the 
previous internal relaxation with fixed angle presented in Extended Data Fig. S. 
The R3m phase in the rhombohedral description is described by three vectors 
of the same length (a= b=c) and by the angles between them (a=8=y). Thetop 
left panel shows the evolution of the rhombohedral angle and the top right 
panel shows the evolution of the rhombohedral lattice parameter (a=b=c). 
The progression of the stress tensor in the quantum SSCHA minimization is 
shown in the bottom left panel. It is clear that at the end of the minimization the 
structure has an angle of 60°, which matches the angle of an fcc lattice and, in 
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this case, the stress is isotropic. Inthe bottom right panel, we show the 
evolution of the Wyckoff positions in the minimization and we compare it with 
that of the Fm3mphase. The occupied Wyckoff positions for both R3mLaH,, 
and Fm3mLaH,, are summarized in Extended Data Table 2. Here, the evolution 
of €,, €, €,and €, parameters in the minimization can be seen. The atoms inthe 
first set of 6c positions approach the 8c Wyckoff site of the Fm3mphase, 
whereas the atoms in the second set of 6c positions and those in 18h sites 
approach the atoms in the 32fWyckoff site of the Fm3m phase, where 
€=0.12053. 
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Extended Data Fig. 5 | Phonon dispersion in R3m-phase LaH,, and the 
anisotropic pressure created ina fixed-cell quantum relaxation. Left, 
harmonicand anharmonic phonon spectrum, maintaining a 62.3° 
rhombohedral angle. The harmonic calculation is performed with the internal 
atomic positions that yield classical vanishing forces. The anharmonic 
calculation is performed after relaxing (with the SSCHA) the internal degrees 
of freedom but maintaining the 62.3° rhombohedral angle. At the harmonic 
level there are unstable phonon modes even at I. Symmetry prevents the 
relaxation of this structure according to the unstable phonon modeatI. The 
harmonic phononsare calculated at aclassic pressure of 150 GPa. Quantum 
effects add around an extra 10 GPato the pressure. To the right of the graphis 
shown the behaviour of A(w) and a?F(w) for the anharmonic calculation. Right, 
pressure along the different Cartesian directions during the SSCHA relaxation 
of the internal parameters, keeping the rhombohedral angle fixed at 62.3°. At 
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step Othe pressure reported is obtained directly from V(R), neglecting 
quantum effects. It is isotropic within 1 GPa of difference between the x-yandz 
directions. At each of the other steps it is calculated from the quantum E(R) 
and along the minimization it becomes anisotropic. When the minimization 
stops at step 12—thatis, the internal coordinates are at the minimum of the 
E(R) for this lattice—the stress anisotropy between thezand thex-y directions 
is about 6%. This clearly indicates that quantum effects act to relax the crystal 
lattice—in particular, because P, is larger—by reducing the rhombohedral angle. 
Itis worth noting that quantum effects increase the total pressure by 
approximately 10 GPa, whichis calculated as P= (P,+ P, + P,)/3. The initial cell 
parameters before the minimization are a=3.5473398 A and a= 62.34158°. The 
initial values of the free Wyckoff parameters, which yield classical vanishing 
forces and a150-GPaisotropic stress, are €,= 0.26043, €,=0.09950, 

€,= 0.10746 and €, = 0.12810. See Extended Data Table 2 for more details. 
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stress tensor is recovered. Asymmetry analysis performed on the structure at 
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electronic DOS at different pressures is plotted for each structure (for different pressures. 
comparison, at the same energy scale). The highly symmetric motif (Fm3m) 
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LaH,,: it is dynamically stable. Bottom right, superconducting Eliashberg 


Extended Data Table 1| Summary of calculated T, values 


System | Pressure (GPa)} A |wiog (meV) Bune os (K) To 6.1 (K)| TMZ (K) (TEC? (kK) 


LaHio 129 3.62 76.4 171.8 252.6 255.3 230 
LaHio 163 2.67 96.4 190.4 247.0 242.8 225 
LaHio 214 2.06 115.5 196.3 235.9 237.9 210 
LaHio 264 1.73 126.6 189.5 219.2 216.9 201 
LaDio 159 3.14 63.5 135.0 184.2 180.4 171 
LaD1o 210 2.21 81.7 145.5 176.5 172.9 158 
LaDio 260 1.80 92.2 142.2 164.6 157.9 151 


Values are calculated using different approaches ranging from empirical to fully ab initio: McMillan equation (EEA); Allen-Dynes-modified McMillan equation CPs anisotropic treatment of 
Migdal-Eliashberg (TME,) and SCDFT (T3°F"). Values of A and @,,, are also given. 
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Extended Data Table 2 | Details of the crystal structures 


Fm3m (C) Fm3m (R) R3m (R) 
1 La 4b [5,5,3]|1 La 4b [5,3,5]|1 La 3b (5,5, 5] 
2H 8c [+,4,4] 2H8c (+, 4, 4] 2H 6c [€a,€a,€a| 
[to] [3.74] [-€a -€a,-€a 


8H 32f [eee] |8 H 32£ [eee] [2H 6c [e,e,,6] 


|-€,-€,-€] |-€,-€,-€] |-€,-€b,-€6| 
|e,€,-€| |-e,-€,3e] |6 H 18h |-ex,-e2,3€y| 
le,-€,€| |-€,3¢,-e| |-€y ,3€x -Ex| 
|-€,€,€| [3e,-€,-e| [Bey .-€x -Ex| 
|-,-€,€| [e,€,-3e| |éx ,€x ,-3€y| 
|-€,€,-€| [e,-3e,€| |€x ,-3€y ,€x| 
e,-€,-e| |-3e,€,€| |-3€y ,€x,€x| 
Composition (Space group) Lattice parameters Wyckoff positions 
LaHio (Immm) a = 3.58303 A La 2c_ [0.50000, 0.50000, 0.00000] 
b= 3.61834 A H 8m (0.75841, 0.00000, 0.11649] 
c = 5.08749 A H 81 (0.00000, 0.75742, 0.87548] 
H 4j  [0.50000, 0.00000, 0.74572] 
LaHjo (C2) a = 6.15468 A La 4c_ [0.49244, 0.00070, 0.25292] 
b = 3.60628 A H 4c 0.13978, 0.24567, -0.05243| 
c = 7.23776 A H 4c {0.09798, 0.24122, 0.45027} 
B = 55.71434° H 4c (0.36015, 0.25590, 0.05238] 
H 4c [0.40204, 0.26021, 0.54971] 
H 4c [-0.09751, 0.00051, -0.05100] 
H 4c [0.86810, 0.00071, 0.43706] 
H 4c_ [0.88713, 0.00076, 0.69398] 
H 4c 0.87083, 0.00068, 0.19089] 
H 4c_ [0.73058, 0.00043, 0.88088] 
H 4c [0.76156, 0.00071, 0.36763] 
LaHi (P4/nmm) a = 3.87435 A La 2c [0.25000, 0.25000, 0.78577] 
b = 3.87435 A H 4e {0.00000, 0.00000, 0.50000} 
c = 5.27636 A H 8i 0.25000, -0.02052, 0.17824] 
H 8i 0.25000, 0.55418, 0.35160} 
H 2a _ 0.75000, 0.25000, 0.00000] 


Top, the table summarizes the occupied Wyckoff positions for different structures found with the minima hopping method and used in SSCHA for minimization. We describe the Wyckoff 
positions using crystal coordinates, so that the [x, y, z] coordinate should be understood as an xa + yb + ze atomic position with a, b, ¢ the lattice vectors. For the R3m phase we use the 
rhombohedral lattice (R), where the three lattice vectors have the same length (a=b =c) and the angle between them is the same (a = = y). The Fm3m phase is described both in this 
rhombohedral description (R) and, for comparison, in the standard cubic conventional lattice (C). In the Fm3m phase the lanthanum atom is described by the 4b sites, two hydrogen atoms 
occupy the 8c sites, and the remaining eight hydrogen atoms occupy the 32f sites. Most of the atomic positions are fixed by symmetry, and overall the Fm3m structure can be described by one 
single free parameter (s). In the R3m phase the lanthanum atom is locked in the 3b sites, two pairs of hydrogen atoms occupy the 6c sites and the remaining six hydrogen atoms occupy the 18h 
sites. In this case symmetry allows for more freedom and overall the structure of the R3m phase can be described by four free parameters (€,, €,, €, and €,). The bottom table shows lattice 
parameters and atomic coordinates for LaH,, (Immm) and LaHjg (C2) at 150 GPa and LaH,, P4/nmm at 100 GPa. These pressures are estimated classically. The positions below give vanishing 
forces at classical level. 
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Spin dynamics in antiferromagnets has much shorter timescales thanin 
ferromagnets, offering attractive properties for potential applications in ultrafast 
devices’ *. However, spin-current generation via antiferromagnetic resonance and 
simultaneous electrical detection by the inverse spin Hall effect in heavy metals have 
not yet been explicitly demonstrated* ®. Here we report sub-terahertz spin pumping 
in heterostructures of a uniaxial antiferromagnetic Cr,O; crystal anda heavy metal (Pt 
or Tainits B phase). At 0.240 terahertz, the antiferromagnetic resonance in Cr,O, 
occurs at about 2.7 tesla, which excites only right-handed magnons. In the spin- 
canting state, another resonance occurs at 10.5 tesla from the precession of induced 
magnetic moments. Both resonances generate pure spin currents in the 
heterostructures, which are detected by the heavy metal as peaks or dips in the open- 
circuit voltage. The pure-spin-current nature of the electrically detected signals is 
unambiguously confirmed by the reversal of the voltage polarity observed under two 
conditions: when switching the detector metal from Pt to Ta, reversing the sign of the 


spin Hall angle’ °, and when flipping the magnetic-field direction, reversing the 
magnon chirality**. The temperature dependence of the electrical signals at both 
resonances suggests that the spin current contains both coherent and incoherent 
magnon contributions, which is further confirmed by measurements of the spin 
Seebeck effect and is well described by a phenomenological theory. These findings 
reveal the unique characteristics of magnon excitations in antiferromagnets and their 
distinctive roles in spin-charge conversion in the high-frequency regime. 


Owing to their terahertz spin dynamics and absence of net magnetiza- 
tion, antiferromagnetic (AFM) materials offer unique advantages for 
ultrafast and robust spin-based nanoscale device applications””®. A 
prerequisite for practical AFM-based spintronics is the generation and 
electrical detection of pure spin currents. Cheng et al.* and Johansen 
etal.° proposed to generate coherent magnon spin currents by inducing 
uniform spin precession at the antiferromagnetic resonance (AFMR) 
with terahertz radiation. This mechanism works in the collinear AFM 
phase at arbitrary magnetic fields (even zero field) belowthe spin-flop 
transition. Ross et al.° detected d.c. voltages at the AFMR in MnF,, but 
concluded that the main d.c. voltage was from the microwave rectifi- 
cation effect or heating-related thermoelectric electromotive force 
(EMF). Clear AFM spin pumping has yet to be experimentally estab- 
lished. Here we demonstrate the generation and simultaneous electri- 
cal detection of pure spin currents in a uniaxial AFM material, Cr,0,, 
The former is accomplished by driving the AFM spin precession into 
resonance using linearly polarized sub-terahertz radiation. Through 
the inverse spin Hall effect (ISHE), the resonantly generated spin cur- 
rent is converted into a d.c. voltage. 

Cr,O, is a uniaxial AFM insulator with the easy axis along the c axis 
of the hexagonal lattice” (Extended Data Fig. 1). Because of its relatively 


simple spin structure and accessible AFMR frequency (about 0.165 THz 
at OK) and spin-flop field (6.0 T at 0 K)'*"’, Cr,0, is chosen for this study. 
As shown in Fig. 1a, 0.240-THz continuous microwaves are generated 
by asolid-state source and propagate into a corrugated waveguide at 
the centre of a12.5-T superconducting magnet. The Cr,O, (1010) slab 
is mounted ona piece of sapphire secured ona Teflon stage located at 
the exit of the waveguide. Figure 1b depicts the sample structure and 
measurement geometry. The sample is oriented with the c axis parallel 
to both the d.c. magnetic field, Ho, and the microwave propagation 
direction, k. The microwave magnetic-field component his kept inthe 
sample plane and perpendicular to the c axis of Cr,O;. Similarly to con- 
ventional ferromagnetic spin pumping, the AFM spins in Cr,O, are 
driven into resonance ata fixed, but much higher, microwave frequency 
by sweeping the d.c. magnetic field. A pure spin currentis then injected 
into the adjacent heavy-metal layer, which in turn produces a charge 
current in Pt, Ta or the Pt-Ta hybrid channel (Methods) through the 
ISHE and results in an open-circuit d.c. voltage, Vic. Microwaves 
reflected by the sample are detected by a superheterodyne receiver 
and recorded as electron magnetic resonance (EMR) signals (Meth- 
ods)°", Both EMR and electrical voltage signals are simultaneously 
measured using the standard lock-in technique. 
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Technology, University of California, Santa Barbara, CA, USA. “Department of Electrical and Computer Engineering, University of California, Riverside, CA, USA. *e-mail: jing.shi@ucr.edu 


70 | Nature | Vol578 | 6 February 2020 


a Solid-state source 
| Bums 
Mier <—— XK Wire-grid 
EMR spectrometer C) : polarizer 
ae ee ~7--% EMRsignal |v 
Mixer 


ft 1 
oO PINH x8 
_/LO system ' Corrugated 


Superheterodyne wav eo 


detection system 


12.5-T magnet 


1 
\ 
s 


0.32 


0.24 


0.16 


Frequency (THz) 


0.08 


Hf / i H 4/ wt 
°V 
f 
| Opn 4 tou 
m, / : m, / 
i — i) — > 


‘ Ke 


Fig.1| AFM spin dynamics and pure spin current in an AFM/heavy-metal 
heterostructure. a, Continuous-wave (CW) EMR system (see Methods). A 
0.240-THz CWis generated by asolid-state source and polarized by a wire-grid 
polarizer before entering the waveguide. The sample is loaded intoa 
continuous-flow cryostat mounted in the room-temperature bore of the 
magnet. CWEMR measurements are carried out using source-intensity 
modulation with a frequency of 13.037 Hz. The reflected CWs are measured by 
asuperheterodyne detection system using a local oscillator (LO) anda 
Schottky subharmonic mixer to mix the 0.240-THz signal to 10 GHz. Then, a 
home-built intermediate-frequency (IF) stage amplifies and mixes this signal 


Figure 1c summarizes the magnetic resonance frequency as a function 
of H, when the magnetic field is applied along the c axis of Cr,0,. Below 
the spin-flop field of 6.0 T, there are two distinct branches correspond- 
ing tothe two eigenmodes of the AFM spin-wave excitations or magnons, 
namely, right-hand (RH) and left-hand (LH) spin precessions with oppo- 
site chiralities*"*". At Hy = O, these two modes are degenerate at w,,/ 
(21t) = 0.165 THz, as shown in Fig. 1d, e (w,, is the magnon angular fre- 
quency at zero magnetic field). This is fundamentally different from 
ferromagnetic materials, in which the sole magnon modeis RH. Because 
these two modes carry equal but opposite angular momenta, +h (A, 
reduced Planck constant; ref.“), the net spin angular momentumis zero 
ifthey are equally populated. When H,is applied along the c axis of Cr,O,, 
the degeneracy between RH and LH modes is lifted, and the frequencies 
of thetwo branchesare given by” w/y = || 2HpHy + (Hpx/2)” +H9(1- a/2), 
where y= 28 GHz T ‘is the gyromagnetic ratio, H, and H, are effective 
fields of the inter-sublattice exchange interaction and the easy-axis ani- 
sotropy, a= y,/x, isthe ratio of the magnetic susceptibilities in the paral- 
leland perpendicular directions, and the + (—) signrefers to the RH (LH) 
mode. At low temperatures, a ~ 0 and w/y = ,|2HH, + Ho, which is rep- 
resented by the two straight lines in Fig. 1c. In our experiments, the 
microwaves are linearly polarized, and the frequency is held at 0.240 THz 
(horizontal dashed line) while Hy is swept; therefore, only the RH mode 
can be excited when the upper branch intercepts the horizontal dashed 
line. For Cr,O,, the low-temperature resonance field of the RH mode is 
estimated to be about 2.7 T, whichis confirmed by our EMR experiment 
(Supplementary Information Note 1). 


downto baseband. The resulting signal is measured in quadrature witha pair of 
lock-in amplifiers. PIN, p-i-n switch. b, Sample structure and spin-current 
injection. Two sublattice magnetic moments, m, and m,, are excited into 
resonance, and the spin currentJ, with spin polarization o generates the charge 
currentJ, inthe heavy-metal (HM) layer. c, Magnetic-resonance frequency asa 
function of the magnetic field Ho (Uo, Magnetic permeability constant) 
applied along thec axis of Cr,O, at OK. The labels d,e and fcorrespond tothe 
panels at right. d-f, The eigenmodes of the RH AFMR (d), the LH AFMR (e), and 
the QFMR (f), as labelled inc. Og, and 6,,,are the spin polarizations associated 
with RH and LH chirality, respectively. 


As the H, strength reaches the spin-flop field, spins in both sublat- 
tices switch abruptly to align nearly perpendicular to H, with a small 
inclination”. We observe an EMR feature at the spin-flop transition 
(Supplementary Information Notel). Anew resonance mode emerges 
above the spin-flop field, as depicted in Fig. 1f. In this mode, the total 
magnetic moment m=m, + m, precesses around H, with RH chirality”. 
This is essentially equivalent to the ferromagnetic resonance mode in 
ferromagnets. We call it quasi-ferromagnetic resonance (QFMR) mode 
to distinguish it from the ferromagnetic resonance mode of the fully 
spin-aligned state, which could only be accessed at extremely high 
magnetic fields. The QFMR frequency is given by” w/y = .{/H2- 2HeHy. 
For w/(21t) = 0.240 THz, the QFMR is estimated to occur at ~10.5 Tin 
Cr,O, (see Supplementary Information Note! for the EMR signal at the 
QFMR). By sweeping H, up to 12T, we can excite both the RH AFMR and 
QFMR modes in Cr,O,. 

At these resonances, uniformly precessing spins in Cr,0,; forma 
k=0 magnonreservoir. Similar to ferromagnetic spin pumping, when 
the magnon reservoir is in contact with a heavy-metal layer suchas Pt, 
a pure spin current flows across the interface via magnon-electron 
interactions and is consequently converted to an open-circuit d.c. 
voltage in Pt due to the ISHE. We observed electrical voltage signals 
at both resonance fields identified by EMR signals. To distinguish the 
pure-spin-current effect from other spurious effects, we use Pt and Taas 
two independent detection channels (Extended Data Fig. 2). Because of 
the opposite sign in their spin Hall angles’ °, the same pure spin current 
must produce voltage signals with opposite polarities in the Pt and Ta 
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Fig. 2 | ISHE signals at the AFMR and the QFMR. a,c, Electrical voltages from 
the Pt channel (a) and the Tachannel (c) at the AFMR at 5K. b,d, Voltages from 
the Pt channel (b) and the Ta channel (d) at the QFMRat 5K.e, f, Voltages 
from the Pt-Ta hybrid channel at the AFMR (e) and the QFMR (f) for both 


positive and negative fields at 10 K. The ISHE voltage is dominated by the Ta 
channel at lowtemperatures. 


channels. Onthe other hand, any spurious electrical signal generated 
by microwave rectification or heating-related thermoelectric EMF may 
maintain the same polarity in Pt and Ta. Indeed, sharp voltage features 
are unmistakably resolved at 2.7 T in two independent Pt and Ta chan- 
nels, as shown in Fig. 2a, c. The resonance field corresponds well to the 
AFMR field identified by EMR. More importantly, the Pt and Ta chan- 
nels register opposite AFMR voltages at 5 K. At10.5 T, where the QFMR 
occurs, the same opposite polarity is observed, as shown in Fig. 2b, d. 
The opposite voltage polarity between the Pt and Ta channels at both 
the AFMR and the QFMR is a defining characteristic of the resonantly 
generated pure spin current. Because the ISHE voltage is proportional 
to the resistivity of the detecting channel, providing the same spin- 
charge conversion efficiency, and Ta is much more resistive (by more 
than a factor of 10) than Pt, we expect a larger voltage response in Ta 
(Supplementary Information Note II). 

After confirming the opposite voltage polarities in the independent 
Ta-only and Pt-only channels, we wire-bond the neighbouring Pt and 
Ta strips in series (Extended Data Fig. 2c) to form a single long hybrid 
channel. All Ptand Tastrips onthe chip are connected to further enhance 
the total voltage signal output. In the following experiments, unless oth- 
erwise specified, we adopt this hybrid detector geometry for electrical 
detection. To corroborate the pure-spin-current nature of the d.c. volt- 
age signals, we reverse the direction of Ho, and therefore the spin polari- 
zation of the pumped spin current. We observe a completely inverted 
voltage signal on the negative-field side (Fig. 2e). The same complete 
voltage inversion is observed for the QFMR despite the complicated line 
shape (shown in Fig. 2f). We note that heating-related thermoelectric 
EMF cannot produce voltage sign reversal when the magnetic field is 
reversed. Hence, the observation of the two sign reversals leads us to 
unambiguously conclude that the d.c. voltage in heavy-metal detectors 
stems exclusively from pure spin current generated by resonant mag- 
nonexcitations. Furthermore, we confirm that the ISHE voltages have 
alinear dependence on the microwave power (Extended Data Fig. 3). 
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Fig. 3 | Temperature dependence of ISHE signals. a, ISHE voltage signal at the 
AFMR.b, ISHE voltage signal at the QFMR. c, Temperature dependence of the 
ISHE peak height AV), (as indicated in a) at the AFMR (symbols) and the best fit 
(line) using equation (S16) in Supplementary Information. d, Temperature 
dependence of the ISHE peak height AV,,,,; (as indicated in b) and of S, 
(symbols): the magnitude of the main symmetric Lorentzian function 

(see Supplementary Information section III). The red solid line is the best fit 
using equation (S17) in Supplementary Information. The error bars represent 
the range of the off-resonance voltage signals. Error bars inc and d are smaller 
than thesymbols. 


Amore rigorous examination of the low-temperature behaviours of 
the AFMR and QFMR shown in Fig. 2 reveals something counterintuitive. 
With the same detecting heavy metal, the ISHE voltage feature at the 
AFMR and the main feature at the QFMR have opposite signs, which 
contradicts the simple picture of coherent spin pumping. As schemati- 
cally illustrated in Fig.1d, f, the RH AFMR and the QFMR eigenmodes 
exhibit the same spin polarization; hence, the resulting ISHE voltages 
should also have the same polarity for a fixed heavy metal. The appar- 
ent contradiction implies that coherent spin pumping is not the sole 
mechanism. To better understand the origin of the sign discrepancy, 
we perform ISHE voltage measurements over a wide range of tempera- 
tures. Figure 3a shows a plot of the ISHE voltage at the AFMR for positive 
magnetic fields from 5 K to 120 K. The voltage signal is negative at low 
temperatures and becomes smaller as the temperature is raised. It 
crosses zero at about 45 K and stays positive at higher temperatures, 
until it finally diminishes at around 120 K. A similar sign change is also 
observed for negative magnetic fields (Extended Data Fig. 4). By con- 
trast, the main ISHE peak at the QFMR (Fig. 3b) always stays positive, 
and decreases monotonically with increasing temperature until it 
finally vanishes above 100 K. We also confirmed the sign-change pat- 
tern of the ISHE signals by comparing the Pt-only and Ta-only channels 
(Extended Data Fig. 5). 

After carefully analysing the line shape of the d.c. voltage signals at 
the QFMRatall temperatures (Supplementary Information Note III), 
we plotted the magnitudes of both the AFMR voltage and the main 
QFMR peak voltage (Fig. 3c, d). The contrast ends at low tempera- 
tures. Clearly, the ISHE voltage at both resonances disappears far 
below the Néel temperature of the Cr,O, crystal (307 K)'*"’. We note 
that the EMR signal at the AFMR is still observable up to 288 K (see 
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Fig. 4| SSE from incoherent AFM magnons. a, b, SSE measurements in 
Cr,0,(1010)/Ta (a) and Cr,0,(1010)/Pt (b) heterostructures. The vertical 
temperature gradient V7 is generated by an on-chip heater. The magnetic field 
His applied in plane and along the c axis. c, d, Field dependence of the SSE 
signal, normalized to the heating power, at 4.6 K, 100K, 200 Kand 300 Kin 
Cr,0,/Ta (c) and Cr,0,/Pt (d) heterostructures. e, f, femperature dependence 
of the SSE signal, normalized to the heating power, under magnetic fields of 
2.7T,5.0T, 10.5 Tand12T in Cr,0,/Ta (e) and Cr,0,/Pt (f) heterostructures. 


Supplementary Fig. 4), indicating that microwave absorption remains 
active at least up to room temperature. Because the ISHE voltage 
depends onthe efficiency of the spin-charge conversion, we believe 
that the quality of the interface may be responsible for the disappear- 
ance of the ISHE signals at a lower temperature than the ordering 
temperature of Cr,03. 

The stark contrast between the temperature dependence of the 
AFMR- and QFMR-induced spin currents indicates that coherent spin 
pumping alone is inadequate to explain these behaviours. The rea- 
sons include: (1) no sign change of the dependence of the spin Hall 
angle of Pt or Ta on the temperature has ever been reported’®; (2) the 
absence of sign change for the QFMR further confirms point (1); (3) 
the spin-polarization direction of the resonant mode is fixed and can- 
not change with temperature. We propose the following mechanism 
to explain this unusual temperature dependence. As illustrated in 
Fig. 1c, whereas the coherently driven AFMR selectively excites the 
RH mode, thermal excitations of incoherent magnons prefer the LH 
mode, which has a lower energy. Asa result, thermally driven LH mag- 
nons compete with coherent RH magnons. When the contribution 
of the former exceeds that of the latter, it causes a sign change of the 
total ISHE voltage. Therefore, the sign change strongly suggests rapid 
thermalization of coherent magnons into incoherent magnons. This 
process results in an increased effective magnon temperature, which 
inturn raises the lattice temperature via magnon-phonon scattering. 
In fact, we observed a temperature rise at the AFMR by monitoring 
the resistance of the Pt-Ta hybrid channel, which serves as a sensitive 


thermometer (Supplementary Information Note IV). A similar mecha- 
nism has recently been proposed for ferromagnetic spin pumping, 
which can result in resonance-induced heating”>™*. 

The increased incoherent magnon population at the resonances 
must be accompanied by an additional spin current flowing across 
the metal-Cr,O, interface, and consequently an additional ISHE volt- 
age of opposite polarity. This process is essentially the same as the 
spin Seebeck effect (SSE), in which the incoherent magnon diffusion is 
driven by atemperature gradient. To better understand the incoherent 
magnoncontribution, we performed independent SSE measurements 
inthe same Cr,O,/Ta and Cr,O,/Pt heterostructures, where coherent 
magnons were completely eliminated. As illustrated in Fig. 4a, b, we 
measured the SSE-induced ISHE voltages by sweeping the magnetic 
field along the c axis of the Cr,O, crystal under a vertical temperature 
gradient generated by a heater on top of the crystal. Figure 4c, d dis- 
plays the field dependence of the SSE signals in Cr,O,/Ta and Cr,O,/Pt. 
First, there is alarge SSE signal at low fields. At 6 T—-that is, the spin-flop 
field—the SSE voltage undergoes an abrupt jump and switches sign. The 
sign switchis exactly what is expected from the magnon energy diagram 
shown in Fig. 1c. Below the spin-flop transition, the LH-magnon branch 
has a lower energy, and thus the total thermal magnon population 
is dominated by LH magnons at low temperatures. Above the spin- 
flop transition, however, the canted state supports only RH magnons; 
therefore, the SSE must change sign across the spin-flop transition. 
The same behaviour was also observed in heterostructures contain- 
ing epitaxial Cr,O; thin films (Extended Data Fig. 6). The SSE signal 
in Cr,O,/Pt is simply inverted compared to that in Cr,O,/Ta for the 
same reason as in the resonance data. Our SSE signal is quite different 
from what was reported by Seki et al.”, which might be caused by the 
different Cr,0,/heavy-metal interface properties (as demonstrated in 
Extended Data Fig. 7). The temperature dependences of both samples 
are summarized in Fig. 4e, f for magnetic fields below and above the 
spin-flop transition and temperatures of up to 310 K. It is interesting 
to note that the incoherent magnon contribution to the ISHE volt- 
age in Cr,O, alone does not cause any sign change, which is different 
from the behaviours of some ferrimagnets”°”’ (see discussion in Sup- 
plementary Information Note V). As the temperature is decreased, 
the magnitude of the SSE voltage below the spin-flop field increases 
precipitously. This trend is the same as that of the AFMRISHE voltage. 
Both can be explained by a decreased equilibrium population of RH 
magnons. At the AFMR, as the temperature is lowered, the increased 
LH-magnon contribution balances out that of the coherently excited 
RH magnonsat ~45 K. In fact, as the temperature is decreased further, 
the net LH-magnon population should reach the maximum, and an SSE 
voltage peak emerges”’. The peak occurs approximately at temperature 
T, when kg 7, ~ h@,,, where k; is the Boltzmann constant. This overall SSE 
characteristic has been observed in other uniaxial AFM materials, such 
as MnF, and FeF,””*°. Compared to these two materials, Cr,O, has alower 
@; hence, the SSE peak should occur at an even lower 7,. We note that 
the SSE peak was not captured in previous experiments because 7, was 
outside their temperature range”, but it appears at -2.3 K in our experi- 
ment, as shown in Fig. 4e, f. In principle, the AFMRISHE voltage would 
also show a peak if the temperature range extended below 7, in our 
resonance experiments. The same proposed coherent-to-incoherent 
magnon thermalization mechanism should also be applicable for the 
QFMR, except that both magnons are RH. In the case of the QFMR, the 
incoherent magnon contribution should be compared with the SSE 
voltage above the spin-flop transition, as is also shown in Fig. 4e, f. 

InSupplementary Information Note VI, we describe the temperature 
dependence of the ISHE voltages at both the AFMR and the QFMR onthe 
basis of a proposed coherent-to-incoherent thermalization mechanism 
that converts energy from coherent Néel order precession into phase- 
random thermal magnons. We capture the thermalization process 
phenomenologically by atemperature dependent parameter n(7), as 
shown in equations (S8) and (S9) in Supplementary Information. We 
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further assume that 7(7) scales as T’, with the exponent a obtained 
from fitting the experimental data. The higher the temperature is, the 
faster incoherent magnons are generated by coherent magnons, which 
leads to a reduction of coherent spin pumping. On the other hand, 
the scattering between thermal magnons and phonons also becomes 
stronger at higher temperatures, which destroys incoherent magnons 
and eventually transfers energy into phonons. Meanwhile, the net 
magnon spin-current polarization, characterized by (7) in equation 
(S15) in Supplementary Information, also decreases with increasing 
T (Extended Data Fig. 8). The combination of these two mechanisms 
results in a substantial reduction of incoherent spin current at higher 
Tvalues. Whereas both coherent and incoherent contributions decay 
with increasing 7, the incoherent contribution varies much faster than 
the coherent contribution, which explains the crossover inthe AFMR 
ISHE voltage at about 45 K in our experiments. As shown in Fig. 3c, d, 
our theory fits the experimental data very well. 

In the case of the QFMR, coherent and incoherent magnons have 
the same chirality, so that they generate spin currents with the same 
polarization; thus, there should be no sign change at any temperature. 
Similarly to the AFMR case, both contributions decrease with increasing 
T; thus, the total spin current simply exhibits amonotonic decay with 7, 
which agrees with the experimental data in Fig. 3d. We also notice that 
the situation of the QFMR is similar to what happens in ferromagnetic 
spin pumping, where the distinction between coherent and incoherent 
contributions is quite subtle (even controversial)**™. In this regard, we 
believe that AFM materials provide a unique playground for studying 
the interplay between coherent and incoherent magnons. 

In summary, we have demonstrated the generation and electrical 
detection of pure spin currents pumped by resonances in uniaxial AFM 
Cr,0, using sub-terahertz radiation, and have unequivocally confirmed 
the ISHE nature of the voltage signals. The intriguing temperature 
dependences of the ISHE voltages at both the AFMR and the QFMR 
suggest that coherent and incoherent magnons contribute to the spin 
current, with the latter dominating at low temperatures. Our experi- 
mental findings set the stage for further exploring spin currents inthe 
emerging field of AFM spintronics. 
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Methods 


Characterization of Cr,0, (1010) crystal 

Cr,0, is a typical uniaxial antiferromagnet with a hexagonal crystal 
structure and with its magnetic easy axis along the c axis (Extended 
Data Fig. 1a). A Cr,O, (1010) single-crystal slab with an in-plane easy 
axis (c axis) and dimensions of 5 x 5 x 1 mm? was purchased from Sur- 
faceNet GmbH. X-ray diffraction patterns are acquired with an Empy- 
rean X-ray diffractometer with a Cu Ka radiation source. The clear 
(3030) Bragg peak at a diffraction angle of 20 = 66° (Extended Data 
Fig. 1b) confirms the (1010) orientation of the crystal. The absence of 
other peaks over a broad range of 20 (inset of Extended Data Fig. 1b) 
suggests the phase purity of the crystal. Tapping-mode atomic force 
microscopy (Bruker Dimension, Model 5000) characterization 
(Extended Data Fig. 1c) is performed on the polished surface of the 
Cr,0,(1010)crystal. The root-mean-square roughness over al pm x 1pm 
area is less than 0.1nm. A clean and smooth interface is crucial for 
observing the spin-pumping signal in our experiments. 


EMR measurement 

Continuous-wave (CW) EMR measurements are performed onahome- 
built CW EMR spectrometer at the Institute for Terahertz Science and 
Technology of the University of California, Santa Barbara. A solid-state 
source, which multiplies the frequency of a 15-GHz synthesizer by a 
factor of 16 to achieve an output frequency of 0.240 THz, produces a 
CW power of 55 mW (Virginia Diodes). The incident microwave power 
is controlled by voltage-controlled attenuation of the source and bya 
pair of wire-grid polarizers. The sample is secured ona sapphire piece 
and mounted ona Teflon stage located at the exit of an overmoded 
waveguide (Thomas Keating). The reflected EMR signal is measured in 
induction mode, where superheterodyne detection is achieved using 
a Schottky subharmonic mixer (Virginia Diodes) to mix the 0.240-THz 
signal down to 10 GHz. A home-made intermediate-frequency stage 
then amplifies and mixes this 10-GHz signal down to baseband. We 
modulate the CW source intensity with a frequency of 13.037 Hz, and 
the reflected signal is measured in quadrature with a pair of lock-in 
amplifiers (Stanford Research System SR830). 


Measurement geometry of AFM spin pumping 

In our AFM spin-pumping experiments, the sample is secured ona 
sapphire piece and mounted ona Teflon stage at the exit of the wave- 
guide. Thecaxis of the Cr,O, (1010) crystal, the propagation direction 
of 0.240-THz microwaves and the external magnetic fields are all along 
the same direction. 

For electrical detection, we pattern eight Pt strips and eight Ta strips 
alternately on the polished surface of the Cr,O; (1010) crystal perpen- 
dicular to the c axis of Cr,O,. All parallel strips have the same lateral 
dimensions of 50 pm x 3.5 mm and a thickness of 5 nm. Each of the 
independent strips produces an EMF like a battery owing to the ISHE 
during the spin-pumping experiments, but the ISHE voltages from the 
Pt and Ta strips have opposite polarities owing to their opposite spin 
Hall angles. To maximize the ISHE signals, we connect these individual 


‘batteries’ in series—that is, positive terminal to negative terminal—to 
produce a large sum signal. As depicted in Extended Data Fig. 2, we 
have three detection geometries to probe the ISHE signals, from Pt- 
only, Ta-only and Pt-Ta hybrid channels. By using multiple strips in 
each geometry, the ISHE voltage can be effectively multiplied. In addi- 
tion, asignal preamplifier (SR560) is used to amplify the voltage signals, 
and the gain is set to 5,000 in all voltage measurements. 


SSE measurements 

To form the Cr,0,/Pt(Ta) heterostructures for the SSE measurements, 
5-nm-thick Pt(Ta) is directly deposited on top of the Cr,O; (1010) crys- 
tal by magnetron sputtering and patterned into a Hall bar with dimen- 
sions of 200 um x 2,740 um perpendicular to the c axis. Then, an 
80-nm-thick AI,O, insulating layer is deposited by atomic layer depo- 
sition, followed by Cr(45 nm)/Au(5 nm) films covering the Hall bar 
channel area as a heater. In the SSE experiment, an a.c. current is 
applied to the Cr/Au heater to generate a vertical temperature gradi- 
ent across the interface, and the double-frequency voltage response 
along the Pt(Ta) Hall bar channel is recorded as the spin Seebeck sig- 
nal V;.- using the standard lock-in technique. The frequency of the a.c. 
current is set at 13 Hz. An external magnetic field is applied in plane 
along the easy axis of the Cr,O, (that is, [0001]) or the c axis during 
the SSE measurements. We normalize the SSE voltages by the heating 
power to draw meaningful comparisons among different measurement 
conditions. 


Data availability 


The datathat support the plots within this paper and other findings of 
this study are available from the corresponding author upon reason- 
able request. 
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indicate the Cr atoms and the spins associated with them, respectively. The Cr,0, (1010) single crystal. 
coloured plane is the (1010) plane. b, X-ray diffraction results of the Cr,O, (1010) 


Extended Data Fig. 2| Measurement geometry of sub-terahertz spin- ends of the strips. Hy is an external magnetic field; hand k are the magnetic 
pumping experiments. a, Pt channel only: only Pt strips are wire-bondedin component and wavevector of the 0.240-THz microwaves, respectively. Visueis 
series. b, Tachannel only: only Ta strips are wire-bonded in series. c, Pt-Ta the open-circuit voltage. The white arrows denote the c axis of Cr,O, (1010). 
hybrid channel. In a-c, black lines indicate conductive wires that connect the 
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Extended Data Fig. 3| Linear microwave power dependence of ISHE signals 


at 10K.a, Field dependence of the ISHE signal at the AFMR for different 
microwave powers. b, Field dependence of the ISHE signal at the QFMR for 


different microwave powers. c, Microwave power dependence of the ISHE 
signal magnitude at both the AFMR and the QFMR. AV; is defined ina and b. 
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Extended Data Fig. 4 | ISHE signal at the AFMR under negative external 


b, Temperature dependence of the magnitude of the ISHE signal under positive 
magnetic fields Ho. a, ISHE signal as a function of the negative magnetic field and negative magnetic fields. Inset, ISHE signal above 30 K. 
H, at different temperatures. Hy is along the easy axis of the Cr,O, (1010) crystal. 
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smoothed ISHE signals. At the AFMR, the ISHE signals of the Pt and Ta channels QFMR mode. At and above 60 K, the QFMR voltage signal shows a single 
at 60 K have opposite signs to that at 5 K (Fig. 2a for Pt and Fig. 2c for Ta). By Lorentzian peak witha slightly larger linewidth than that of the AFMR peak. 
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Extended Data Fig. 6 | SSE signal at 9.9 K in Cr,0; (100 nm)/Pt and 
Cr,0,(100 nm)/Ta heterostructures. a, Cr,O, (100 nm)/Pt heterostructure. 


b, Cr,0;(100 nm)/Ta heterostructure. The Cr,O, is a (1120)-oriented epitaxial 
thin film deposited onan Al,O; (1120) substrate. The magnetic field is applied 
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along the c axis of Cr,0;. The SSE signal changes sign across the spin-flop 
transition, which further confirms that LH magnons (dominating the SSE below 


the spin-flop transition) and RH magnons (dominating the SSE above the spin- 
flop transition) carry opposite angular momenta. 
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Extended Data Fig. 7 | SSE signal in bulk Cr,0,(1010)/Pt. a, b, Results are 
shown for bulk Cr,0,(1010)/Pt with untreated (a) and etched (b) interfaces. For 
the untreated sample, we anneal the crystal in air at 600 °C for 2 husing a tube 
furnace before the deposition of the Pt layer. For the etched sample, we first 
bombard the surface of the Cr,O, crystal with argon ions using inductively 
coupled plasma, and then anneal it in air at 600 °C for 2 husing a tube furnace 
before we deposit the Pt layer. The etching process does not affect the sign of 
the SSE signal above the spin-flop transition; however, it changes its sign below 
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the spin-flop transition. A possible reason is that the etching process may 
produce some uncompensated magnetic momentsat the interface owing to 
the different sputtering yields of Cr and O atoms, and these uncompensated 
magnetic momentsalso contribute to the SSE signal by modifying the 
interfacial spin-mixing conductance or directly generating additional spin 
current. In addition, the etched sample generates a much lower SSE signal than 
the untreated sample under the same measurement conditions. 
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modelling and numerical results of &(7) .a, Schematic device geometry used numerical calculations based on equation (S13) in Supplementary Information. 
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(equation (S9) in Supplementary Information). The bilayer structure is respectively. 
represented by an AFM layer and anon-magnetic (NM) metal layer of thickness 
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Complex-oxide materials exhibit a vast range of functional properties desirable for 
next-generation electronic, spintronic, magnetoelectric, neuromorphic, and energy 
conversion storage devices’ *. Their physical functionalities can be coupled by 
stacking layers of such materials to create heterostructures and can be further 
boosted by applying strain*’. The predominant method for heterogeneous 
integration and application of strain has been through heteroepitaxy, which 
drastically limits the possible material combinations and the ability to integrate 
complex oxides with mature semiconductor technologies. Moreover, key physical 
properties of complex-oxide thin films, such as piezoelectricity and 
magnetostriction, are severely reduced by the substrate clamping effect. Here we 
demonstrate a universal mechanical exfoliation method of producing freestanding 
single-crystalline membranes made from a wide range of complex-oxide materials 
including perovskite, spinel and garnet crystal structures with varying 
crystallographic orientations. In addition, we create artificial heterostructures and 
hybridize their physical properties by directly stacking such freestanding membranes 
with different crystal structures and orientations, which is not possible using 
conventional methods. Our results establish a platform for stacking and coupling 
three-dimensional structures, akin to two-dimensional material-based 
heterostructures, for enhancing device functionalities®’. 


Traditionally, heterogeneous coupling and control of strain for crystal- 
line films are carried out through heteroepitaxy on lattice-mismatched 
substrates". Epitaxial methods, however, have fundamental limi- 
tations that prevent the unrestricted manipulation, integration and 
utilization of these materials. First, heteroepitaxy occurs only for dif- 
ferent materials that havea lattice constant or crystal structure within 
acertain threshold. Thus, heterostructuring via epitaxy is allowed 
only for relatively limited material systems. Moreover, the degree of 
strain that can be applied to an epitaxial layer is fixed by pseudomor- 
phic epitaxial conditions. Second, the epitaxial filmis clamped by the 
substrate, constraining several important properties. For example, 
the piezoelectric and magnetostrictive responses are dampened by 
approximately an order of magnitude by the substrate clamping effect, 
reducing their sensitivity and maximum response”. Third, epitaxial 
growth typically requires elevated temperatures, often preventing 
the epitaxial integration of materials that are stable in very different 


environments or are thermodynamically unstable when in contact with 
each other; such instability typically precludes the epitaxial integration 
of complex oxides with mainstream semiconductor materials. Thus, 
it has been extremely challenging to form heterostructures between 
materials with large lattice mismatch or between materials chosen 
solely for the desired properties they would bring to an artificial het- 
erostructure, and it is even more challenging to unclamp epitaxial 
films from the substrate. Freestanding heterostructures without any 
limitations in crystal structures are often achieved in two-dimensional 
(2D) material systems by stacking ultrathin layers (a few atoms thick) 
of 2D materials”, and the concepts of layer transfer of single materi- 
als or of various individual devices composed of nanomaterials onto 
foreign substrates have been demonstrated in the past*’*. However, 
artificial heterostructuring of multiple single-crystalline membranes 
and robust physical coupling, experimentally demonstrated here, 
have been elusive. Although chemical lift-off of oxide materials has 
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Fig. 1| Epitaxial lift-off of complex-oxide membranes on graphene-coated 
substrates. Cross-sectional TEM of STO film grown ona graphene-coated STO 
substrate without (a) and with (b) a graphene protection layer. White boxes 
indicate the fast Fourier transform areas, which confirm the crystallinity 

of each epitaxial region in comparison to the substrate. c, EBSD of the 
exfoliated STO membrane, confirming the single-crystalline out-of-plane 
(001) orientation. 2ML, two monolayers. d, Asymmetric @scan of the STO 
membrane measured by high-resolution X-ray diffraction. EBSD and 


been reported’, this method is applicable only to a limited range of 
material systems owing to the lattice mismatch and etch selectivity con- 
straints between the epitaxial layer, sacrificial layer and the substrate. 
Additionally, slow release rate is generally a well known shortcoming 
of chemical lift-off for larger substrates. 

Here we prepare artificial complex-oxide heterostructure stacks 
using mechanical lift-offtechniques, in which the epitaxial oxide films 
are instantly separated from weakened epitaxial interfaces to form free- 
standing single-crystalline membranes. These techniques can, in the- 
ory, be universally applied to prepare freestanding membranes across 
abroad range of crystal structures (for example, perovskite, spinel and 
garnet) with the potential capability of reusing the host oxide substrate. 
We now demonstrate freestanding membranes made from several 
important oxide structures including archetypal perovskite SrTiO, 
(STO), perovskite BaTiO; (BTO), spinel CoFe,0, (CFO), garnet Y;Fe;O,, 
(YIG) and a perovskite of complex composition Pb(Mg,/,Nb,/3)03- 
PbTiO, (PMN-PT). Single-crystalline STO, BTO, CFO and YIG were 
remote-epitaxially (that is, by epitaxial growth of thin films seeded by 
the underlying substrate through a few layers of graphene)” grown on 
graphene-coated STO, MgAl,0, (MAO) and Gd;Ga;0,, (GGG) substrates, 
respectively, followed by mechanical exfoliation. In addition, we dem- 
onstrate single-crystalline freestanding membranes of PMN-PT that are 
grown via sputtering, which damages graphene”. This was enabled by 
discovering that SrRuO, (SRO) can provide a weak interface with PMN- 
PT, allowing PMN-PT films to be mechanically released precisely at the 
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asymmetric @ scans of CFO (100) (e, f) and YIG (111) (g, h) membranes, showing 
single-crystallinity with uniform out-of-plane orientation and no in-plane 
rotation. i-k, Photographs of the exfoliated oxide membranes (100 nm of 

STO (i), CFO (j) and YIG (k)) supported on thermal release tape. The strain 
applied tothe film in the figure with a bending radius of 1.5 cmis not sufficient 
to crack the films***’. The strain increases with thickness and for the 100-nm- 
thick films with 1.5 cm bending radius, the strain is around 0.1%, whichis much 
smaller than the critical cracking strain of about 1%. 


PMN-PT/SRO interface without use of graphene. From these freestand- 
ing membranes, we fabricated various heterostructures with the goal of 
coupling the unique properties of the component materials by stacking 
them directly (that is, there is nothing between the layers). Enhanced 
magnetoelectric coupling has been observed by stacking magnetostric- 
tive CFO and piezoelectric PMN-PT, because their physical properties 
can be greatly enhanced in freestanding form by being declamped from 
the substrate. We also demonstrate magnetostatic and magnetoelastic 
coupling ina CFO/YIG membrane heterostructure. More importantly, 
we verified the electrical coupling of graphene sandwiched between 
freestanding CFO and YIG membranes by tracing the Fermi level shift 
with respect to the Dirac point of graphene. Our findings advance oxide 
research by allowing unrestricted integration of single-crystalline, dis- 
similar, complex-oxide membranes into elaborate heterostructures 
unattainable by epitaxy and chemical lift-off methods, which opens up 
opportunities to produce unprecedented three-dimensional (3D) het- 
erostructures formed from various freestanding 2D or 3D membranes. 

We first studied the growth dynamics of STO films on graphene- 
coated STO (001) substrates (see Supplementary Information for details 
of the transfer process). Our density functional theory calculation sug- 
gests that atomic potential fields can penetrate completely through 
bilayer graphene and partially through trilayer graphene (Extended 
Data Fig. 1), thus allowing successful remote epitaxy up to two monolay- 
ers of graphene interlayers. Our pulsed-laser-deposition experimental 
results precisely followed our prediction: single-crystalline STO films 
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Fig. 2| Precise epitaxial interface separation of PMN-PT onaSRO/STO 
substrate. a, Photograph of as-grown PMN-PT onaSRO/STO substrate (left), 
SRO/STO substrate after PMN-PT exfoliation (bottom right), and exfoliated 
PMN-PT membrane (top right). The scale bar indicates 2mm. b, HAADF-STEM 
image of the PMN-PT/SRO/STO interface, with high-stress Ni (100 nm) 
deposited ontop.c, AFM of the SRO/STO substrate surface after exfoliating the 
PMN-PT. The scale bar indicates 5 pm. d, Cross-sectional TEM (left) of the 
exfoliated PMN-PT using a Ti (30 nm)/Ni (3 pm) metal stressor layer, verifying 


were successfully grown through bilayer graphene interlayers prov- 
ing successful seeding from the STO substrates through graphene* 
(see Methods for epitaxy conditions). In situ high-pressure reflection 
high-energy electron diffraction (RHEED) during growth also showed 
clear intensity oscillations and crystallinity of the film during growth 
(Extended Data Fig. 2). In contrast to epitaxy of semiconductor com- 
pounds from groups III-V and III-nitrides on graphene where the 
adatoms or substrate do not react with graphene, it is crucial to avoid 
oxidation of graphene for epitaxy of oxides to ensure the release of 
remote epitaxial STO”. Initially, we were unable to exfoliate the STO 
films grown in a conventional oxygen overpressure owing to the gra- 
phene being etched during the nucleation stage from the substrate as 
evidenced by the absence of graphene in cross-sectional transmission 
electron microscopy (TEM) (Fig. 1a). To protect the graphene, we depos- 
ited an ultrathin STO buffer (about 5-10 nm), whichis not grownina 
conventional oxygen overpressure, but under vacuum (<5 x10 ° torr). 
By applying this buffer, we were able to preserve the graphene (Fig. 1b), 
resulting in successful production of freestanding STO membranes 
(Extended Data Fig. 3). An electron backscatter diffraction (EBSD) map 
of the STO film (Fig. 1c) showed (001) cubic orientation over a large 
area. Additionally, azimuthal X-ray diffraction @ scanning confirmed 
in-plane single-crystallinity without any rotated domains (Fig. 1d). We 
also verified, via electron energy loss spectroscopy, that further growth 
of STO inan oxygen overpressure effectively corrects the oxygen stoi- 
chiometry of the entire STO film, even the region grown under vacuum 
(Extended Data Fig. 4). We note that because the STO substrate is also 


Ni stressor 


Height (nm) 


5 nm 


PMN-PT @ scan 


Intensity 
(counts per second) 


10° 
0 50 100 150 200 250 300 


¢ (’) 
theintegrity of the PMN-PT crystallinity after exfoliation. Needle-like contrast 
areas are caused by slight variations in composition and phase that are 
attributable to misfit strain**. Representative high-resolution TEM (middle) 
and selective-area diffraction (right) images are also shown, verifying the 
single-crystalline nature of the exfoliated PMN-PT membrane. e, f, EBSD (e) and 
asymmetric @ scan (f) of the exfoliated PMN-PT membrane, showing high- 
quality single-crystallinity and no in-plane rotation. 


asource of oxygen”, the exfoliation area yield of the sample with one 
monolayer of graphene is low compared to the exfoliation area yield 
of samples with two or more graphene layers (Supplementary Fig. 1). 
Through these findings, we concluded that two graphene layers are 
optimal to achieve the highest ratio of crystal quality to exfoliation 
yield. 

As representative cases for spinel and garnet oxides, we grew spinel 
CFO and garnet YIG on graphene-coated MAO (001) and GGG (111) sub- 
strates, respectively. Single-crystallinity of the grown film was again 
verified by EBSD and high-resolution X-ray diffraction (Fig. le-h). The 
magnetization values of freestanding CFO and YIG were within a reason- 
able range of the bulk values, indicating the good quality of freestanding 
single-crystalline membranes (Supplementary Fig. 2). We carried out 
cross-sectional transmission electron microscope (TEM) measure- 
ments on the exfoliated CFO membrane to confirm the crystallinity 
at an atomic scale. Within the single-crystalline matrix, we were able 
to observe localized polycrystalline domains (Extended Data Fig. 5). 
The case is similar for other remote epitaxial membranes (see Extended 
Data Fig. 3). These polycrystalline domains are probably caused by 
regions of non-uniform graphene thickness or organic/metal residues 
left from graphene transfer and due to the high sticking coefficient 
of oxide adatoms”. Thus, the quality of the transferred graphene on 
the substrate determines the exfoliation area and crystallinity of the 
epitaxial film. Regardless, applying a thin protection layer before grow- 
ing in an oxygen environment is effective for all materials explored 
here: all epitaxial films were successfully released from the substrate. 
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Fig. 3 | Heterogeneous integration of CFO and PMN-PT membranes for 
strain-mediated thin-film magnetoelectrically coupled heterostructure. 
a, Device schematic of the CFO/PMN-PT magnetoelectric device. The CFO has 
an area of 4x 3 mm?andathickness of 300 nm. The PMN-PT has an area of 

5x 5mm/anda thickness of 500 nm. For the clamped device, the substrate is 
STO and the bottom electrode is SRO. For the freestanding membrane device, 
the substrate is PDMS and the bottom electrode is Ti. b, Schematic of the setup 
to measure the magnetic-field-induced voltage across the PMN-PT. A small 
alternating-current magnetic field is supplied by a Helmholtz coil and a direct- 
current magnetic field is provided by an electromagnet. The magnetic-field 
direction is applied parallel to the sample. The voltage generated across the 
PMN-PT membrane is measured asa function of the alternating-current 
magnetic-field amplitude with a lock-in amplifier. DUT stands for device under 


The resulting flexible STO, CFO and YIG membranes with thicknesses 
of 100 nm are shown in Fig. li-k, supported by a flexible handling 
tape. Moreover, we verified the compatibility of this process using 
molecular-beam epitaxy (MBE), another technique often used to grow 
single-crystalline complex-oxide films, by growing BTO on graphene- 
coated STO substrates and exfoliating the BTO film. Single-crystalline 
BTO membranes were produced, similar to the results obtained by 
pulsed-laser deposition (Extended Data Fig. 6). 

We also discovered that the bilayer graphene interlayer not only 
enhances exfoliation yield, but also minimizes damage to the substrate 
upon peeling, thus promoting reusability of the substrates. We have 
successfully obtained multiple CFO freestanding membranes by reus- 
ing asingle graphene-coated MAO substrate three times. The magnetic 
hysteresis measured from freestanding CFO using vibrating sample 
magnetometry was consistent throughout each exfoliation, confirm- 
ing the reusability of the MAO substrate. To the best of our knowledge, 
our demonstration is the first to show wafer reusability for producing 
freestanding complex oxides, which drastically reduces production 
costs for applications (see Extended Data Fig. 7 and Supplementary 
Fig. 3 for details). 

We have further broadened our mechanical exfoliation technique 
to oxides with more complex compositions such as PMN-PT. Single- 
crystalline PMN-PT films were prepared previously by sputtering””®. 
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test.c, The voltage induced across the PMN-PT (6V,,) as a function of the 
alternating-current magnetic field strength at a frequency of 1 kHz. The inset 
shows schematics of the freestanding and the clamped devices. d, Cross- 
sectional TEM image of the CFO/PMN-PT membrane heterostructure. A thin Pt 
(7 nm) film was deposited before transferring the CFO as the top contact to 
PMN-PT. The blue box shows a zoomed-in TEM of the CFO/Pt/PMN-PT 
interface, showing that an oxide bonding layer has formed spontaneously. 

e, Insitu TEM of CFO asa function of applied voltage across the PMN-PT 
membrane. The strain generated by applying a voltage across the PMN-PT is 
transferred to the CFO layer, which induces a large change in strain contrast. 
The movement of the strain contrast can be more clearly seenin 

the Supplementary Video. 


Consequently, remote epitaxy strategies cannot be applied owing to the 
harsh plasma overpressure that rapidly etches graphene*’. We discov- 
ered that PMN-PT is weakly bonded to SRO, allowing mechanical exfolia- 
tion of PMN-PT with near-atomic precision. For this, we grew 500-nm 
PMN-PT/100-nm SRO epitaxial heterostructures on STO substrates by 
sputtering without graphene, followed by the deposition of a3-5-m 
Ni stressor layer with a stress of around 800 MPa. Upon mechanical 
exfoliation, the Ni stressor provided enough strain energy to guide 
the crack propagation precisely at the PMN-PT/SRO interface with 
minimum damage to the substrate (Fig. 2a). As shownin the high-angle 
annular dark-field (HAADF) scanning transmission electron microscope 
(STEM) image in Fig. 2b, the PMN-PT/SRO interface is severely strained 
whereas the SRO/STO interface is pristine. After depositing high-stress 
Ni on PMN-PT, indications of an increased strain at the interface was 
observed. Our geometric phase analysis revealed that a closely spaced 
network of misfit dislocations (spaced about 20 nm apart) applies 
strain at the interface while the Ni stressor provides additional stress 
tothe PMN-PT/SRO interface (Extended Data Fig. 8). We speculate that 
the resulting concentrated strain field at the PMN-PT/SRO interface 
provides a sufficiently weak interface to allow atomically precise crack 
propagation. We have reproducibly verified the precise crack propaga- 
tion through the PMN-PT/SRO interface by atomic force microscopy 
(AFM) onthe exfoliated SRO surface, which showed a root-mean-square 
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Fig. 4 | Electrical, magnetostatic and magnetoelastic coupling between 3D 
and 2D materials. a, Out-of-plane magnetic hysteresis of CFO membrane, YIG 
membrane, and CFO/YIG membrane heterostructures measured at room 
temperature. The YIG and CFO membranes individually show hysteresis loops 
with anin-plane easy axis resulting from shape anisotropy. The YIG has a low in- 
plane coercivity of H, ,p=10 Oe, the CFO is magnetically harder with 

H.,p= 2,950 Oe (Supplementary Fig. 2), and the out-of-plane loops correspond 
to hard-axis loops. b, The linear sum of the magnetic hysteresis of individual 
CFO and YIG membranes compared to that of the measured CFO/YIG 
membrane heterostructure. emu, electromagnetic unit. c, Magnetic hysteresis 
of the CFO/YIG membrane and of just the CFO membrane at 300 °C. The YIG 
becomes paramagnetic and only the CFO loop remains at 300 °C. When the 


roughness of about 2A (Fig. 2c). Cross-sectional TEM investigation of 
the exfoliated PMN-PT membranes revealed high crystalline quality 
as wellas relieved strain at the PMN-PT surface that was bonded to the 
SRO before exfoliation (see Fig. 2d). Single-crystallinity over alarge area 
has been confirmed by EBSD mapping and azimuthal @ scans (Fig. 2e, 
f). In TEM, diffraction patterns taken from all imaged areas showed a 
single (001) orientation consistent with other characterization. We 
note that this technique also worked for (110) PMN-PT films grown on 
SRO or STO (Supplementary Fig. 4). Although details of the mechanism 
for the weakened interface between SRO and PMN-PT still remain to 
be verified, this finding further broadens the range of complex-oxide 
material systems that can be produced as freestanding membranes 
and provides opportunities to develop a graphene-free layer-release 
process by further exploring interface strain engineering. 

Next, we fabricated heterostructures by stacking our freestanding 
membranes where robust mechanical coupling was observed with 


heterostructure membrane is cooled back downto room temperature, full 
recovery of the original hysteresis is observed. d, Schematic illustration of the 
electrical coupling between graphene and magnetic insulators. Graphene is 
undoped (the Fermi level lies at the Dirac point) on thick hBN, n-doped on YIG, 
and p-doped on CFO, shifting the Fermi level back to near the Dirac point when 
sandwiched between YIG and CFO. e, The Raman spectra (focused on the 2D 
peak and the G peak) of each structure shown in d. f-h, The shift in the 2D peak 
(Pos(2D)) (f), the shift in the G peak (Pos(G)) (g) and the FWHM of the G peak 
(FWHM,) (h) for graphene on hBN, YIG and sandwiched between YIG and CFO 
membranes, respectively. The error bars indicate the standard deviation often 
measurements from different parts of the sample. 


high transfer yield (>90%). We first chose to stack CFO membranes 
onto PMN-PT membranes (see Supplementary Fig. 5 for the thin-film 
stacking procedure) to create a composite multiferroic (Fig. 3a). This 
composite allows (1) strain-mediated electric-field control of the mag- 
netism in CFO or (2) magnetic-field-induced voltage generation across 
PMN-PT by virtue of the magnetostrictive and piezoelectric properties 
of CFO and PMN-PT, respectively. 

We expect that the piezoelectric and magnetostrictive properties 
of these two films will be enhanced when both membranes are in their 
freestanding form given that they are thus free from the substrate 
clamping effect. PMN-PT is a material with remarkably high piezoelec- 
tric coefficient in its single-crystalline form* *, whereas CFO hasa high 
magnetostriction coefficient’. Thus, an enhanced strain-mediated 
magnetoelectric response can be expected from the stacked multi- 
ferroic heterostructure if both films are freestanding compared to 
when at least one of the films is clamped to the substrate. Until now, 
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it has only been possible to use bulk materials bonded by glue* or to 
grow polycrystalline films on top of piezoelectric wafers* or mem- 
branes” to realize such a hybrid structure. As expected, we observed 
a substantially enhanced coupling effect when both the CFO and 
PMN-PT were freestanding compared to that of the device where the 
PMN-PT was clamped to the substrate, by measuring the magnetically 
induced magnetoelectric coupling as shown in Fig. 3b. As shown in 
Fig. 3c, the freestanding CFO/PMN-PT device produces substantially 
larger voltage (6V,,,) than the clamped device by more than an order 
of magnitude, with corresponding magnetoelectric coupling coeffi- 
cients of 477 mV cm Oe ‘and 2,675 mV cm™ Oe for the clamped and 
declamped devices, respectively. This magnetoelectric coupling coef- 
ficient is approximately an order of magnitude larger than previously 
reported coefficients on the same material system and comparable 
thickness**”’. This data indicates that strain transfer from CFO to PMN- 
PTis more effective for the freestanding CFO/PMN-PT heterostructure. 
Indeed, such excellent strain transfer between the two freestanding 
membranes was observed using in situ TEM (see Fig. 3d and Extended 
Data Fig. 9 for SEM of fabricated in situ device), showing the change 
in the CFO structure in response to an applied voltage (-10 V to 10 V) 
across the PMN-PT. We note that a bonding oxide layer was spontane- 
ously formed between the CFO membrane and Pt layer for efficient 
strain transfer from the PMN-PT to the CFO (see bottom TEM image 
of Fig. 3d). The Supplementary Video shows the motion of the strain 
fringes in CFO in response to the strain induced by the biased PMN-PT 
underneath (see Fig. 3e for TEM images with or without bias showing 
the generation of strain fringes upon biasing). We were careful to pre- 
vent strain fringes from being generated owing to the sample flexing 
(see Methods for TEM sample preparation). The excellent strain transfer 
from freestanding PMN-PT to CFO was also verified by observing the 
large modulation of the magnetic hysteresis of CFO as a function of bias 
across PMN-PT, in contrast to the clamped device (see Extended Data 
Fig. 10). Thus, the CFO/PMN-PT device showcases an example of a3D 
heterostructure where the functionality of each material is enhanced 
by stacking freestanding 3D membranes of the constituent materials. 

We fabricated additional 3D complex-oxide heterostructures as 
well as 2D to 3D mixed heterostructures forming direct junctions to 
study the feasibility of new physical couplings that are not possible by 
conventional epitaxy. First, we observed clear magnetostatic coupling 
froma CFO/YIG stack. As shownin Fig. 4a, b, the measured hysteresis of 
the CFO/YIG heterostructure repeatedly showed a sharper reversal of 
its out-of-plane magnetization compared to the sum of the individual 
CFO and YIG loops. A signature of magnetoelastic coupling was also 
observed by heating the CFO/YIG heterostructure to 300 °C, just above 
the Curie temperature of YIG (277 °C). As shown in Fig. 4c, the loop of 
YIG/CFO differs from that of a single CFO layer at 300 °C although the 
YIG no longer contributes a magnetic moment above its Curie tem- 
perature. We speculate that the higher thermal expansion coefficient 
of YIGimposes an in-plane strain on CFO, resulting ina magnetoelastic 
anisotropy favouring out-of-plane magnetization. Cooling to room 
temperature restored the two-step loop seen in Fig. 4a. These findings 
lay the foundations to discover physical coupling phenomena through 
simple stacking of these and many other functional oxides, choosing 
hetero-systems of interest from the huge library of freestanding mem- 
brane material sets enabled by various techniques. 

This 3D heterostructuring technique not only offers great flexibility 
to design coupled multifunctional oxide films with enhanced perfor- 
mance, but also provides a platform to integrate various 3D and 2D 
material heterostructures with tailored functionalities to study novel 
interface phenomena. For example, we were able to tune the Fermi level 
of graphene with respect to its Dirac point by sandwiching it between 
YIG and CFO membranes. This was measured by tracking the 2D and 
G peaks of the Raman spectra of graphene, wherein contact with the 
YIG n-dopes the graphene and contact with CFO p-dopes the graphene, 
whereas graphene stays intrinsic when on thick hexagonal boron nitride 
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(hBN) (30 nm) (Fig. 4d)*°. It is well known that the shift in the 2D and 
G peaks as well as the full-width at half-maximum (FWHM) change in 
the G peak correlates with the Fermi level“. As expected, a clear indi- 
cation of n-doping was observed on graphene transferred onto a YIG 
membrane relative to the undoped graphene on hBN (blue-shift and 
red-shift of the 2D and G peaks, respectively, with a narrowing of the 
FWHM of the G peak). When graphene is sandwiched by transferring 
aCFO membrane ontop of the graphene/YIG stack, graphene reverts 
back to anearly undoped state similar to graphene on hBN (Fig. 4e-h). 
Such electrically coupled 2D and 3D mixed heterostructures providea 
platform on which to study interfacial and proximity-induced physical 
couplings in 2D and 3D heterostructures, which so far has only been 
possible theoretically*°. Combined with other conventional lift-off 
methods, it is now possible to couple and integrate an unprecedent- 
edly broad range of functional single-crystalline membranes (III-V or 
III-N, complex oxides and 2D layered materials) on a single platform. 
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Methods 


Epitaxial graphene growth 

Monolayer epitaxial graphene was grown via silicon sublimation from 
the silicon face of 6H silicon carbide (SiC (0001)) in a three-phase, 
hot-zone, graphite furnace (Thermal Technology LLC). In this case, 
a 4-inch wafer was used, and a graphite crucible was constructed to 
accommodate the 4-inch wafer in the furnace. The SiC was first cleaned 
using organic solvents (acetone, isopropyl alcohol, Nanostrip). Sub- 
sequently, the SiC is annealed in 10% hydrogen (with the rest argon) 
at 1,500 °C for 30 min to remove subsurface damage due to chemical 
and mechanical polishing. The H, was then purged from the system, 
and the temperature was increased to 1,800 °C for 10 min at 700 torr 
to form the graphene layers. This process yields low-defect-density 
monolayer epitaxial graphene. 


Epitaxial complex-oxide growth 

Surface preparation. Prior to graphene transfer and growth, the STO 
substrate surface was dipped in buffered hydrofluoric acid for 20s and 
annealed ina furnace at 1,100 °C for 6 h. AFM was measured to ensure 
step-and-terrace surface morphology. MAO and GGG substrates were 
rinsed in acetone and isopropyl alcohol for 5 min each in an ultrasonic 
bath with no special surface treatment. 


Pulsed laser deposition. STO, CFO and YIG films were grown using 
a pulsed-laser deposition with a KrF laser energy of 400 mJ and pulse 
rate of 10 Hz. Commercial ceramic or bulk single-crystal targets were 
used. STO was grown ontop of graphene-coated (100) STO substrates 
at atemperature of 850 °C and an oxygen flow of 20 mtorr. The initial 
500 shots to the target were made without oxygen flow to protect the 
graphene layer on the oxide substrate for all materials. The CFO film 
was grown at a temperature of 400 °C and an oxygen pressure of 10 
mtorr on top of a graphene-coated (100) MAO substrate. Finally, the 
YIG film was grown at atemperature of 700 °C and oxygen pressure of 
20 mtorr ontop of a graphene-coated (111) GGG substrate. After growth, 
the YIG film was then post-annealed at 850 °C for 2h under an oxygen 
overpressure to improve crystal quality. 


Sputtering deposition. 90° off-axis sputtering* and misaligned par- 
allel dual planar magnetron sputtering”*”’ were employed to deposit 
epitaxial SRO and PMN-PT films, respectively. The SRO layer (100 nm) 
was deposited at a temperature of 600 °C and total pressure of 200 
mtorr while maintaining a 3:2 ratio of Ar and O, gases. The PMN-PT 
layer (SOO nm) was grown at atemperature of 625 °C under a total back- 
ground pressure of 500 mtorr, maintaining a 17:3 ratio of Ar and O,. 


MBE deposition. BTO films were grown by MBEina Veeco GEN10 MBE 
system. Molecular beams of barium and titanium were generated using 
aconventional effusion cell and a Ti-Ball titanium sublimation pump, 
respectively. The fluxes were calibrated using RHEED intensity oscil- 
lations. Barium and titanium were co-deposited onto the substrate in 
an oxygen background partial pressure of 7 x 10” torr. The substrate 
temperature was held at 850 °C. In situ RHEED images were consist- 
ent with the growth of smooth and epitaxial thin-film surfaces during 
deposition. 


Graphene transfer 

First, the graphene was exfoliated from its host SiC substrate by deposit- 
ing Ni(~500 nm) as an adhesive/support layer. This was accomplished 
by first depositing a thin Nilayer using electron-beam evaporation (20 
nm) to protect the graphene, followed by Ni sputtering at a chamber 
pressure of 1 x 10° torr and Ar flow of 9.5 standard cubic centimetres 
per minute (sccm). A thermal release tape (Revalpha 319Y-4M) was 
then used to detach the Nilayer along with the graphene. The thermal 
release tape/Ni/graphene stack was directly transferred onto the oxide 


substrate, and the thermal release tape was released at atemperature 
of 120 °C. The Ni was then etched in FeCl, solution, leaving only gra- 
phene on the oxide substrate. Finally, the sample was gently rinsed in 
acetone and isopropyl alcohol. This process was repeated to transfer 
two to three layers of graphene. 


Ni stressor deposition 

The Ni stressor layer was deposited using plasma sputtering, using a 
commercially bought Ni target with 99.99% purity. A thin Ti adhesive 
layer (20-80 nm) was deposited using electron-beam evaporation 
before depositing the Ni stressor. The Ni was sputtered at a chamber 
pressure of 2 x 10° torr with 9.5 sccm of Ar flow, with a growth rate of 
approximately 2 umh7. 


Characterization 

SEM, EBSD, AFM and Raman measurements. SEM and EBSD measure- 
ments were made using a ZEISS Merlin high-resolution SEM equipped 
with an EBSD detector. AFM measurements were carried out using a 
Park NX10 AFM tool in non-contact mode. Raman spectra were obtained 
using a Renishaw Invia Reflex Raman confocal microscope witha laser 
wavelength of 532 nm, power of 1mW and alaser spot size of 2 um. 


TEM measurements. Cross-sectional TEM specimens were prepared 
using the focused ion beam (FEI Helios 660) technique. To prevent ion- 
beam damage and contamination caused by metal ions, the sample 
was passivated using electron-beam assisted amorphous carbon (100 
nm) before using the focused ion beam. During the ion-milling process, 
the ion-beam energy was artificially controlled from 30 kV to2 kV to 
achieve ultrathin TEM samples. Ex situ (S)TEM experiments were per- 
formed using JEOL 2010F and JEOL ARM 200CF (probe Cs-corrected) 
microscopes operated at 200 kV. Atomic-resolution STEM observa- 
tions of epitaxial films were conducted using aJEOL ARM 200CF with 
a probe convergence angle of 20 mrad. A HAADF detector angle of 
90-175 mrad and an annular bright-field detector angle of 11-23 mrad 
were used. For in situ TEM experiments, a miniature CFO/PMN-PT 
magnetoelectric coupled device was fabricated using the focused 
ion beam technique. An electron-beam assisted Pt electrode for metal 
probe contact was deposited onto the PMN-PT films, and the sample 
surface, including CFO and PMN-PT, was passivated by electron-beam 
induced amorphous carbon. A focused-ion-beam-cleaved specimen 
was connected with a metal half-grid to make the electric circuit, and 
this miniature device was isolated by aside-cutting method usingion 
milling with a low acceleration voltage of 5 kV. To remove the amor- 
phous-carbon-assisted effect, the remaining amorphous carbon on 
the top of CFO was eliminated using a low-energy ion beam during 
the final milling stage. In situ TEM experiments were carried out us- 
ing aJEOL 2010F analytical electron microscope with an acceleration 
voltage of 200 kV in TEM mode equipped with a biasing holder (Na- 
nofactory Instruments AB) functionalized by a scanning tunnelling 
microscopy system. For electrical switching, a direct-current bias 
was applied inside a TEM between a sharp Pt-Ir tip operated by the 
scanning tunnelling microscopy function, contacting directly with the 
7-nm-thick Pt layer. The TEM probe tip placement was made far from 
the observed CFO region (about 5 pm), witha relatively thick platinum 
contact region to minimize any effects from bending of the sample. 
Only negligible displacement of the sample was observed during in situ 
measurements, which also preclude any bending effects. Real-time 
high-resolution TEM videos were captured using a 2,048 pixel x 1,080 
pixel resolution charge-coupled device (CCD) camera. 


Magnetoelectric device fabrication 

We first transferred freestanding single-crystalline PMN-PT ontoa Ti- 
coated polydimethylsiloxane (PDMS) substrate where Ti was used as 
the bottom electrode, followed by fabricating a 7-nm Pt top contact 
on the PMN-PT. Then, the CFO membrane was directly transferred 


onto the Pt-coated PMN-PT to complete the heterostructured device. 
The device was annealed at 150 °C overnight to remove any moisture. 


Magnetoelectric coupling measurement 

We applied a small alternating-current magnetic field at a frequency of 
1kHz ontop of a direct-current magnetic field (5 kOe) in-plane across 
the CFO/PMN-PT device, then measured the induced voltage across the 
PMN-PT membrane. Voltage is generated across the PMN-PT membrane 
when the magnetoelastic strain in the CFO induced by the magnetic 
field is transferred to the PMN-PT“. 


Data availability 


The data that support the findings of this study are available from the 
corresponding author upon reasonable request. 
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Extended Data Fig. 1| Density functional theory simulation of substrate 
surface potential penetrating through graphene layers ona STO substrate. 
a, Illustration of the simulated structure. b, The potential fluctuation through 


graphene asa function of monolayer (IML) and bilayer (2ML) graphene 


thickness d. The inset shows the potential fluctuation map on the surface of 
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graphene-coated STO substrates for monolayer graphene (left) and bilayer 
graphene (right). c, The potential fluctuation map (colour scale) with three 
monolayers of graphene on top of the STO surface. d, Cross-sectional potential 
profile along the red line showninc. 
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Extended Data Fig. 2| Pulsed-laser deposition of STO on graphene-coated 


The yellow arrows indicate RHEED patterns caused by the transferred 
STO substrate. a, RHEED pattern during growth of STO on graphene-coated 


graphene. b, RHEED oscillation during the growth of STO onthe graphene- 
STO substrate, showing crystalline growth through the entire growth process. coated STO substrate. 
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Extended Data Fig. 3 | Cross-sectional TEM analysis of exfoliated STO 
membrane. a, Low-magnification TEM image of STO membrane supported by 
the Nistressor layer. b, Zoom-in of polycrystalline domains caused by residues 
left on graphene after transfer. c, High-resolution TEM image of the STO 
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membrane (centre), with selected area electron diffraction (left) and high- 
resolution TEM (right) images, confirming the overall single-crystallinity ofthe 
membrane. The single-crystalline STO membrane has acubic structure inthe 
Pm3mspace group with lattice distances dig. and doo, of 0.4nm. 
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Extended Data Fig. 4| STEM analysis of the SrTiO, buffer layer grownin 
vacuum.a, and b, show the bright-field (BF) and HAADF-STEM images of the 
exfoliated STO membrane (the same TEM sample as shown in Extended Data 
Fig. 3) at low magnification, respectively. c and d show higher-resolution 
bright-field and HAADF-STEM images of the sample, respectively. e, Electron 
energy loss spectroscopy spectra and line profile from the exfoliated surface 
tothe bulk region (A-B ind), verifying that the composition of the buffer layer 
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grown invacuumis identical to the region grown under oxygen overpressure. 
f, High-resolution HAADF, showing individual atoms of the STO membrane at 
the exfoliation surface (the region grownin vacuum). The annular bright-field 
(ABF) and contrast inverted annular bright-field images (g and h, respectively) 
clearly show the absence of oxygen vacancies and no discernible differences 
are observed between the regions grown in vacuum and in oxygen. 
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Extended Data Fig. 5 | Cross-sectional TEM analysis of exfoliated CFO CFO film (centre), with selected area electron diffraction (left) and high- 
membrane. a, Cross-sectional TEM of exfoliated CFO onthe Ti/Ni stressor resolution TEM (right) images, confirming the overall single-crystallinity ofthe 
layer. The red dotted line indicates a polycrystalline domain causedbyresidues | membrane. Thesingle-crystalline CFO membrane has a cubic structure inthe 
left during graphene transfer. b, Zoomed-in TEM of the polycrystalline area Fd3mspace group witha lattice distance d), and doo, of 0.41nm. 


marked by red dotted lines. c, Higher-resolution cross-sectional TEM of the 
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50 nm BTO membrane ~ 


Extended Data Fig. 6| Exfoliation and characterization ofBaTiO,membrane crystalline (100) orientation. c, The inverse-pole map of the EBSD data shown 
grown via MBE. a, Photograph of exfoliated BTO membrane (50 nm) grown via inb.d, Electron backscattering patterns (also knownas Kikuchi patterns) of the 
remote epitaxy. b, EBSD of the exfoliated BTO membrane showing single- BTO membraneseen over the entire area of the sample. 
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Extended Data Fig. 7| Reusability ofa graphene-coated MAO substrate. 

a,b, Microscope images of aMAO substrate after exfoliating a CFO film grown 
on monolayer (a) and bilayer (b) graphene, where severe damage onthe surface 
ofthe MAO substrate after exfoliation of CFO grown on monolayer graphene 
was observed, caused by crack propagation into the substrate. No evidence of 
damage was observed on substrates coated with bilayer graphene because the 
second graphene transfer covers the macroscopic defective areas of the first 
graphene layer. The scale bar indicates 1mm.c, AFM of the pristine MAO 
substrate surface (left) and after one cycle of CFO exfoliation (right) with a root- 
mean-square roughness of approximately 5.5 A before and after exfoliation. 
Scale bar indicates 11m. d, e, Raman spectra showing the characteristic D peak, 


G peak and 2D peak of graphene (d) and Raman intensity mapping (e) of the 2D 
peak (2,685 cm) of graphene onthe MAO substrate after one cycle of CFO 
exfoliation, showing evidence that graphene is preserved onthe MAO substrate 
after exfoliation, probably because the non-specific adhesion between 
graphene and CFO is weaker than that between graphene and MAO. Where 

this is not the case, we could etch off any graphene remaining onthe substrate 
and re-deposit graphene before epitaxy. The scale bar indicates 10 pm. 

f, Magnetic hysteresis M of the three exfoliated CFO membranes produced ona 
single graphene-coated MAO substrate measured by vibrating sample 
magnetometry at room temperature. a.u., arbitrary units. 
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Extended Data Fig. 8 | STEM imaging and strain analysis of the PMN-PT/SRO/ d, Geometric phase analysis of the PMN-PT/SRO and SRO/STO interfaces in the 
STO interfaces. a, b, Cross-sectional HAADF-STEM images of the PMN-PT/SRO/ =x direction (2nd column), y direction (3rd column) and rotational geometry 


STO interfaces with and without a Ni stressor layer. Clear straining at the PMN- (last column) with and without the Ni stressor layer. The white arrows indicate 
PT/SRO interface can be seen witha Ni stressor layer, whereas the SRO/STO edge dislocations. The colour scale indicates the strain fraction with reference 
interface remains unstrained. c, Atomic-resolution STEM image of one of the tothe SRO substrate. 


periodic edge dislocations observed at the PMN-PT/SRO interface. 
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Extended Data Fig. 9 | Description of the in situ TEM CFO/PMN-PT much thicker and further away from the actively observed region (distance 
heterostructure device. a, A cross-sectional SEM of the in situ CFO/PMN-PT greater than5 pm) to prevent effects caused by bending of the sample. b, High- 
magnetoelectric device. A thick Pt layer (labelled TEM probe contact) was resolution TEM image of the CFO/Pt/PMN-PT interface, showing a thin 
deposited on top of the 7-nm-thick Pt layer to enable the TEM probe tip to amorphous oxide layer that has formed between the CFO and Pt, enabling 


establish electrical contact. The TEM probe contact was intentionally made efficient strain coupling. 
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structure, the PMN-PT film is grown onaSRO/STO substrate, and the CFO 
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Cancer is driven by genetic change, and the advent of massively parallel sequencing has 
enabled systematic documentation of this variation at the whole-genome scale’ °. Here 
we report the integrative analysis of 2,658 whole-cancer genomes and their matching 
normal tissues across 38 tumour types from the Pan-Cancer Analysis of Whole Genomes 
(PCAWG) Consortium of the International Cancer Genome Consortium (ICGC) and The 
Cancer Genome Atlas (TCGA). We describe the generation of the PCAWG resource, 
facilitated by international data sharing using compute clouds. On average, cancer 
genomes contained 4-5 driver mutations when combining coding and non-coding 
genomic elements; however, in around 5% of cases no drivers were identified, 
suggesting that cancer driver discovery is not yet complete. Chromothripsis, in which 
many clustered structural variants arise in a single catastrophic event, is frequently an 
early event in tumour evolution; in acral melanoma, for example, these events precede 
most somatic point mutations and affect several cancer-associated genes 
simultaneously. Cancers with abnormal telomere maintenance often originate from 
tissues with low replicative activity and show several mechanisms of preventing 
telomere attrition to critical levels. Common and rare germline variants affect patterns 
of somatic mutation, including point mutations, structural variants and somatic 
retrotransposition. A collection of papers from the PCAWG Consortium describes 
non-coding mutations that drive cancer beyond those in the TERT promoter’*; identifies 
new signatures of mutational processes that cause base substitutions, small insertions 
and deletions and structural variation>*; analyses timings and patterns of tumour 
evolution’; describes the diverse transcriptional consequences of somatic mutation on 
splicing, expression levels, fusion genes and promoter activity®”; and evaluates a range 


of more-specialized features of cancer genomes 


8,10-18 


Cancer is the second most-frequent cause of death worldwide, 
killing more than 8 million people every year; the incidence of cancer 
is expected to increase by more than 50% over the coming decades””””. 
‘Cancer’ is acatch-all term used to denote a set of diseases characterized 
by autonomous expansion and spread of a somatic clone. To achieve 
this behaviour, the cancer clone must co-opt multiple cellular pathways 
that enable it to disregard the normal constraints on cell growth, modify 
the local microenvironment to favour its own proliferation, invade 
through tissue barriers, spread to other organs and evade immune sur- 
veillance”. No single cellular program directs these behaviours. Rather, 
thereisa large pool of potential pathogenic abnormalities from which 
individual cancers draw their own combinations: the commonalities 
of macroscopic features across tumours belie a vastly heterogeneous 
landscape of cellular abnormalities. 

This heterogeneity arises from the stochastic nature of Darwinian 
evolution. There are three preconditions for Darwinian evolution: 
characteristics must vary within a population; this variation must be 
heritable from parent to offspring; and there must be competition for 
survival within the population. Inthe context of somatic cells, heritable 
variation arises from mutations acquired stochastically throughout 
life, notwithstanding additional contributions from germline and 
epigenetic variation. A subset of these mutations alter the cellular 
phenotype, and a small subset of those variants confer an advantage 


on clones during the competition to escape the tight physiological 
controls wired into somatic cells. Mutations that provide a selective 
advantage to the clone are termed driver mutations, as opposed to 
selectively neutral passenger mutations. 

Initial studies using massively parallel sequencing demonstrated the 
feasibility of identifying every somatic point mutation, copy-number 
change and structural variant (SV) ina given cancer’ °. In 2008, recog- 
nizing the opportunity that this advance in technology provided, the 
global cancer genomics community established the ICGC with the 
goal of systematically documenting the somatic mutations that drive 
common tumour types”. 


The pan-cancer analysis of whole genomes 


The expansion of whole-genome sequencing studies from individual 
ICGC and TCGA working groups presented the opportunity to under- 
take a meta-analysis of genomic features across tumour types. To 
achieve this, the PCAWG Consortium was established. A Technical 
Working Group implemented the informatics analyses by aggregating 
the raw sequencing data from different working groups that studied 
individual tumour types, aligning the sequences to the human genome 
and delivering a set of high-quality somatic mutation calls for down- 
stream analysis (Extended Data Fig. 1). Given the recent meta-analysis 


A list of members and their affiliations appears in the online version of the paper and lists of working groups appear in the Supplementary Information. 
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Box 1 


Online resources for data access, visualization and analysis 


The PCAWG landing page (http://docs.icgc.org/pcawg) provides 
links to several data resources for interactive online browsing, 
analysis and download of PCAWG data and results (Supplementary 
Table 4). 

Direct download of PCAWG data 

Aligned PCAWG read data in BAM format are also available at 

the European Genome Phenome Archive (EGA; https://www. 
ebi.ac.uk/ega/search/site/pcawg under accession number 
EGASO0001001692). In addition, all open-tier PCAWG genomics 
data, as well as reference datasets used for analysis, can be 
downloaded from the ICGC Data Portal at http://docs.icgc.org/ 
pcawg/data/. Controlled-tier genomic data, including SNVs and 
indels that originated from TCGA projects (in VCF format) and 
aligned reads (in BAM format) can be downloaded using the 

Score (https://www.overture.bio/) software package, which has 
accelerated and secure file transfer, as well as BAM slicing facilities 
to selectively download defined regions of genomic alignments. 
PCAWG computational pipelines 

The core alignment, somatic variant-calling, quality-control and 
variant consensus-generation pipelines used by PCAWG have each 
been packaged into portable cross-platform images using the 
Dockstore system™ and released under an Open Source licence that 
enables unrestricted use and redistribution. All PCAWG Dockstore 
images are available to the public at https://dockstore.org/ 
organizations/PCAWG/collections/PCAWG. 

ICGC Data Portal 

The ICGC Data Portal® (https://dcec.icgc.org) serves as the main 
entry point for accessing PCAWG datasets with a single uniform web 
interface and a high-performance data-download client. This uniform 
interface provides users with easy access to the myriad of PCAWG 
sequencing data and variant calls that reside in many repositories 
and compute clouds worldwide. Streaming technology®™ provides 
users with high-level visualizations in real time of BAM and VCF files 
stored remotely on the Cancer Genome Collaboratory. 


of exome data fromthe TCGA Pan-Cancer Atlas”, scientific working 
groups concentrated their efforts on analyses best-informed by whole- 
genome sequencing data. 

We collected genome data from 2,834 donors (Extended Data 
Table 1), of which 176 were excluded after quality assurance. A further 
75 had minor issues that could affect some of the analyses (grey-listed 
donors) and 2,583 had data of optimal quality (white-listed donors) 
(Supplementary Table 1). Across the 2,658 white- and grey-listed donors, 
whole-genome sequencing data were available from 2,605 primary 
tumours and 173 metastases or local recurrences. Mean read coverage 
was 39x for normal samples, whereas tumours had a bimodal cover- 
age distribution with modes at 38x and 60x (Supplementary Fig. 1). 
RNA-sequencing data were available for 1,222 donors. The final cohort 
comprised 1,469 men (55%) and 1,189 women (45%), with a mean age of 
56 years (range, 1-90 years) across 38 tumour types (Extended Data 
Table 1and Supplementary Table 1). 

To identify somatic mutations, we analysed all 6,835 samples using 
auniform set of algorithms for alignment, variant calling and quality 
control (Extended Data Fig. 1, Supplementary Fig. 2and Supplementary 
Methods 2). We used three established pipelines to call somatic single- 
nucleotide variations (SNVs), small insertions and deletions (indels), 
copy-number alterations (CNAs) and SVs. Somatic retrotransposition 
events, mitochondrial DNA mutations and telomere lengths were also 
called by bespoke algorithms. RNA-sequencing data were uniformly 


UCSC Xena 

UCSC Xena’®”’ (https://pcawg.xenahubs.net) visualizes all PCAWG 
primary results, including copy-number, gene-expression, gene-fusion 
and promoter-usage alterations, simple somatic mutations, large 
somatic structural variations, mutational signatures and phenotypic 
data. These open-access data are available through a public Xena 
hub, and consensus simple somatic mutations can be loaded to the 
local computer of a user via a private Xena hub. Kaplan-Meier plots, 
histograms, box plots, scatter plots and transcript-specific views offer 
additional visualization options and statistical analyses. 

The Expression Atlas 

The Expression Atlas (https://www.ebi.ac.uk/gxa/home) contains 
RNA-sequencing and expression microarray data for querying 

gene expression across tissues, cell types, developmental stages 
and/or experimental conditions®. Two different views of the data 
are provided: summarized expression levels for each tumour type 
and gene expression at the level of individual samples, including 
reference-gene expression datasets for matching normal tissues. 
PCAWG Scout 

PCAWG Scout (http://pcawgscout.bsc.es/) provides a framework for 
-omics workflow and website templating to generate on-demand, 
in-depth analyses of the PCAWG data that are openly available to the 
whole research community. Views of protected data are available 
that still safeguard sensitive data. Through the PCAWG Scout web 
interface, users can access an array of reports and visualizations 
that leverage on-demand bioinformatic computing infrastructure 
to produce results in real time, allowing users to discover trends as 
well as form and test hypotheses. 

Chromothripsis Explorer 

Chromothripsis Explorer (http://compbio.med.harvard.edu/ 
chromothripsis/) is a portal that allows structural variation in the 
PCAWG dataset to be explored on an individual patient basis 
through the use of circos plots. Patterns of chromothripsis can also 
be explored in aggregated formats. 


processed to call transcriptomic alterations. Germline variants identi- 
fied by the three separate pipelines included single-nucleotide poly- 
morphisms, indels, SVs and mobile-element insertions (Supplementary 
Table 2). 

The requirement to uniformly realign and call variants on approxi- 
mately 5,800 whole genomes presented considerable computational 
challenges, and raised ethical issues owing to the use of data from dif- 
ferent jurisdictions (Extended Data Table 2). We used cloud comput- 
ing”*”’ to distribute alignment and variant calling across 13 data centres 
on 3 continents (Supplementary Table 3). Core pipelines were pack- 
aged into Docker containers” as reproducible, stand-alone packages, 
which we have made available for download. Data repositories for raw 
and derived datasets, together with portals for data visualization and 
exploration, have also been created (Box 1and Supplementary Table 4). 


Benchmarking of genetic variant calls 


To benchmark mutation calling, we ran the 3 core pipelines, together 
with 10 additional pipelines, on 63 representative tumour-normal 
genome pairs (Supplementary Note 1). For 50 of these cases, we per- 
formed validation by hybridization of tumour and matched normal DNA 
toacustom bait set with deep sequencing”. The 3 core somatic variant- 
calling pipelines had individual estimates of sensitivity of 80-90% 
to detect a true somatic SNV called by any of the 13 pipelines; more 
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Fig. 1| Validation of variant-calling pipelines in PCAWG. a, Scatter plot of 
estimated sensitivity and precision for somatic SNVs across individual 
algorithms assessed in the validation exercise across n = 63 PCAWG samples. 
Core algorithms included in the final PCAWG call set are shown in blue. 

b, Sensitivity and precision estimates across individual algorithms for 
somatic indels. c, Accuracy (precision, sensitivity and F, score, defined as 

2 x sensitivity x precision/(sensitivity + precision)) of somatic SNV calls across 
variant allele fractions (VAFs) for the core algorithms. The accuracy of two 
methods of combining variant calls (two-plus, which was used in the final 
dataset, and logistic regression) is also shown. d, Accuracy of indel calls 
across variant allele fractions. 


than 95% of SNV calls made by each of the core pipelines were genu- 
ine somatic variants (Fig. 1a). For indels—a more-challenging class of 
variants to identify with short-read sequencing—the 3 core algorithms 
had individual sensitivity estimates in the range of 40-50%, with pre- 
cision of 70-95% (Fig. 1b). For individual SV algorithms, we estimated 
precision to be inthe range 80-95% for samples in the 63-sample pilot 
dataset. 

Next, we defined a strategy to merge results from the three pipelines 
into one final call-set to be used for downstream scientific analyses 
(Methods and Supplementary Note 2). Sensitivity and precision of 
consensus somatic variant calls were 95% (90% confidence interval, 
88-98%) and 95% (90% confidence interval, 71-99%), respectively, for 
SNVs (Extended Data Fig. 2). For somatic indels, sensitivity and preci- 
sion were 60% (34-72%) and 91% (73-96%), respectively (Extended Data 
Fig. 2). Regarding somatic SVs, we estimate the sensitivity of merged 
calls to be 90% for true calls generated by any one pipeline; precision 
was estimated as 97.5%. The improvement in calling accuracy from 
combining different pipelines was most noticeable in variants with 
low variant allele fractions, which probably originate from tumour 
subclones (Fig. 1c, d). Germline variant calls, phased using a haplotype- 
reference panel, displayed a precision of more than 99% and asensitivity 
of 92-98% (Supplementary Note 2). 


Analysis of PCAWG data 


The uniformly generated, high-quality set of variant calls across more 
than 2,500 donors provided the springboard for a series of scientific 
working groups to explore the biology of cancer. A comprehensive 
suite of companion papers that describe the analyses and discoveries 
across these thematic areas is copublished with this paper* * (Extended 
Data Table 3). 
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Pan-cancer burden of somatic mutations 


Across the 2,583 white-listed PCAWG donors, we called 43,778,859 
somatic SNVs, 410,123 somatic multinucleotide variants, 2,418,247 
somatic indels, 288,416 somatic SVs, 19,166 somatic retrotransposition 
events and 8,185 de novo mitochondrial DNA mutations (Supplemen- 
tary Table 1). There was considerable heterogeneity in the burden of 
somatic mutations across patients and tumour types, with a broad 
correlation in mutation burden among different classes of somatic 
variation (Extended Data Fig. 3). Analysed at a per-patient level, this 
correlation held, even when considering tumours with similar purity 
and ploidy (Supplementary Fig. 3). Why such correlation should apply 
onapan-cancer basis is unclear. It is likely that age has some role, as we 
observe a correlation between most classes of somatic mutation and 
age at diagnosis (around 190 SNVs per year, P= 0.02; about 22 indels 
per year, P=5 x 10°; 1.5 SVs per year, P< 2 x10; linear regression 
with likelihood ratio tests; Supplementary Fig. 4). Other factors are 
also likely to contribute to the correlations among classes of somatic 
mutation, as there is evidence that some DNA-repair defects can cause 
multiple types of somatic mutation”, and a single carcinogen can cause 
arange of DNA lesions”. 


Panorama of driver mutations in cancer 


We extracted the subset of somatic mutations in PCAWG tumours 
that have high confidence to be driver events on the basis of current 
knowledge. One challenge to pinpointing the specific driver muta- 
tions in an individual tumour is that not all point mutations in recur- 
rently mutated cancer-associated genes are drivers*™. For genomic 
elements significantly mutated in PCAWG data, we developed a ‘rank- 
and-cut’ approach to identify the probable drivers (Supplementary 
Methods 8.1). This approach works by ranking the observed mutations 
inagiven genomic element based on recurrence, estimated functional 
consequence and expected pattern of drivers in that element. We then 
estimate the excess burden of somatic mutations in that genomic 
element above that expected for the background mutation rate, and cut 
the ranked mutations at this level. Mutations ineach element with the 
highest driver ranking were then assigned as probable drivers; those 
below the threshold will probably have arisen through chance and were 
assigned as probable passengers. Improvements to features that are 
used to rank the mutations and the methods used to measure them 
will contribute to further development of the rank-and-cut approach. 

We also needed to account for the fact that some bona fide cancer 
genomic elements were not rediscovered in PCAWG data because 
of low statistical power. We therefore added previously known 
cancer-associated genes to the discovery set, creating a ‘compendium 
of mutational driver elements’ (Supplementary Methods 8.2). Then, 
using stringent rules to nominate driver point mutations that affect 
these genomic elements on the basis of prior knowledge™, we separated 
probable driver from passenger point mutations. To cover all classes 
of variant, we also created a compendium of known driver SVs, using 
analogous rules to identify which somatic CNAs and SVs are most likely 
to act as drivers in each tumour. For probable pathogenic germline 
variants, we identified all truncating germline point mutations and 
SVs that affect high-penetrance germline cancer-associated genes. 

This analysis defined a set of mutations that we could confidently 
assert, based on current knowledge, drove tumorigenesis inthe more 
than 2,500 tumours of PCAWG. We found that 91% of tumours had at 
least one identified driver mutation, with an average of 4.6 drivers per 
tumour identified, showing extensive variation across cancer types 
(Fig. 2a). For coding point mutations, the average was 2.6 drivers per 
tumour, similar to numbers estimated in known cancer-associated 
genes in tumours in the TCGA using analogous approaches”. 

To address the frequency of non-coding driver point mutations, 
we combined promoters and enhancers that are known targets of 
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Fig. 2| Panorama of driver mutations in PCAWG.a, Top, putative driver 
mutations in PCAWG, represented asa circos plot. Each sector representsa 
tumour inthe cohort. From the periphery to the centre of the plot the 
concentric rings represent: (1) the total number of driver alterations; (2) the 
presence of whole-genome (WG) duplication; (3) the tumour type; (4) the 
number of driver CNAs; (5) the number of driver genomic rearrangements; 

(6) driver coding point mutations; (7) driver non-coding point mutations; and 
(8) pathogenic germline variants. Bottom, snapshots of the panorama of driver 
mutations. The horizontal bar plot (left) represents the proportion of patients 
with different types of drivers. The dot plot (right) represents the mean 
number of each type of driver mutation across tumours with at least one event 
(the square dot) and the standard deviation (grey whiskers), based onn=2,583 


non-coding drivers** ” with those newly discovered in PCAWG data; 
this is reported in a companion paper‘. Using this approach, only 
13% (785 out of 5,913) of driver point mutations were non-coding 
in PCAWG. Nonetheless, 25% of PCAWG tumours bear at least one 
putative non-coding driver point mutation, and one third (237 out 
of 785) affected the TERT promoter (9% of PCAWG tumours). Overall, 
non-coding driver point mutations are less frequent than coding 
driver mutations. With the exception of the TERT promoter, indi- 
vidual enhancers and promoters are only infrequent targets of driver 
mutations’. 

Across tumour types, SVs and point mutations have different rela- 
tive contributions to tumorigenesis. Driver SVs are more prevalent 
in breast adenocarcinomas (6.4 + 3.7 SVs (mean + s.d.) compared 
with 2.2 + 1.3 point mutations; P< 1x 10°, Mann-Whitney U-test) 
and ovary adenocarcinomas (5.8 + 2.6 SVs compared with 1.9 +1.0 
point mutations; P<1x 107°), whereas driver point mutations have 


Proportion of patients with the gene altered as biallelic 


patients. b, Genomic elements targeted by different types of mutations inthe 
cohort altered in more than 65 tumours. Both germline and somatic variants 
are included. Left, the heat map shows the recurrence of alterations across 
cancer types. The colour indicates the proportion of mutated tumours and the 
number indicates the absolute count of mutated tumours. Right, the 
proportion of each type of alteration that affects each genomic element. 

c, Tumour-suppressor genes with biallelic inactivation in 10 or more patients. 
The values included under the gene labels represent the proportions of 
patients who have biallelic mutations in the gene out of all patients witha 
somatic mutation in that gene. GR, genomic rearrangement; SCNA, somatic 
copy-number alteration; SGR, somatic genome rearrangement; TSG, tumour 
suppressor gene; UTR, untranslated region. 


a larger contribution in colorectal adenocarcinomas (2.4 + 1.4 SVs 
compared with 7.4 + 7.0 point mutations; P= 4 x 107°) and mature 
B cell lymphomas (2.2 + 1.3 SVs compared with 6 + 3.8 point muta- 
tions; P<1x10°"*), as previously shown*®. Across tumour types, there 
are differences in which classes of mutation affect a given genomic 
element (Fig. 2b). 

We confirmed that many driver mutations that affect tumour- 
suppressor genes are two-hit inactivation events (Fig. 2c). For exam- 
ple, of the 954 tumours in the cohort with driver mutations in TP53, 
736 (77%) had both alleles mutated, 96% of which (707 out of 736) 
combined a somatic point mutation that affected one allele with 
somatic deletion of the other allele. Overall, 17% of patients had 
rare germline protein-truncating variants (PTVs) in cancer-predis- 
position genes”, DNA-damage response genes“ and somatic driver 
genes. Biallelic inactivation due to somatic alteration on top ofa 
germline PTV was observed in 4.5% of patients overall, with 81% of 
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Fig. 3 | Analysis of patients with no detected driver mutations. a, Individual 
estimates of the percentage of tumour-in-normal contamination across 
patients with no driver mutations in PCAWG (n=181). No data were available for 
myelodysplastic syndromes and acute myeloid leukaemia. Points represent 
estimates for individual patients, and the coloured areas are estimated density 
distributions (violin plots). Abbreviations of the tumour types are defined in 
Extended Data Table 1.b, Average detection sensitivity by tumour type for 
tumours without known drivers (n=181). Each dot represents a given sample 
and is the average sensitivity of detecting clonal substitutions across the 
genome, taking into account purity and ploidy. Coloured areas are estimated 
density distributions, shown for cohorts with at least five cases.c, Detection 


these affecting known cancer-predisposition genes (such as BRCAI, 
BRCA2 and ATM). 


PCAWG tumours with no apparent drivers 


Although more than 90% of PCAWG cases had identified drivers, we 
found none in 181 tumours (Extended Data Fig. 4a). Reasons for miss- 
ing drivers have not yet been systematically evaluated in a pan-cancer 
cohort, and could arise from either technical or biological causes. 
Technical explanations could include poor-quality samples, inad- 
equate sequencing or failures in the bioinformatic algorithms used. 
We assessed the quality of the samples and found that 4 of the 181 
cases with no known drivers had more than 5% tumour DNA contami- 
nation in their matched normal sample (Fig. 3a). Using an algorithm 
designed to correct for this contamination”, we identified previously 
missed mutations in genes relevant to the respective cancer types. 
Similarly, if the fraction of tumour cells in the cancer sample is low 
through stromal contamination, the detection of driver mutations 
can be impaired. Most tumours with no known drivers had an aver- 
age power to detect mutations close to 100%; however, a few had 
power in the 70-90% range (Fig. 3b and Extended Data Fig. 4b). Even 
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sensitivity for TERT promoter hotspots in tumour types in which TERT is 
frequently mutated. Coloured areas are estimated density distributions. 

d, Significant copy-number losses identified by two-sided hypothesis testing 
using GISTIC2.0, corrected for multiple-hypothesis testing. Numbersin 
parentheses indicate the number of genes in significant regions when 
analysing medulloblastomas without known drivers (n= 42). Significant 
regions with known cancer-associated genes are labelled with the 
representative cancer-associated gene. e, Aneuploidy in chromophobe renal 
cell carcinomas and pancreatic neuroendocrine tumours without known 
drivers. Patients are ordered onthe y axis by tumour type and then by presence 
of whole-genome duplication (bottom) or not (top). 


in adequately sequenced genomes, lack of read depth at specific 
driver loci can impair mutation detection. For example, only around 
50% of PCAWG tumours had sufficient coverage to call a mutation 
(=90% power) at the two TERT promoter hotspots, probably because 
the high GC content of this region causes biased coverage (Fig. 3c). 
In fact, 6 hepatocellular carcinomas and 2 biliary cholangiocarcinomas 
among the 181 cases with no known drivers actually did contain TERT 
mutations, which were discovered after deep targeted sequencing”. 

Finally, technical reasons for missing driver mutations include fail- 
ures in the bioinformatic algorithms. This affected 35 myeloprolif- 
erative neoplasms in PCAWG, in which the /AK2”"" driver mutation 
should have been called. Our somatic variant-calling algorithms rely 
on‘panels of normals’, typically from blood samples, to remove recur- 
rent sequencing artefacts. As 2-5% of healthy individuals carry occult 
haematopoietic clones*, recurrent driver mutations in these clones 
can enter panels of normals. 

With regard to biological causes, tumours may be driven by muta- 
tions in cancer-associated genes that are not yet described for that 
tumour type. Using driver discovery algorithms on tumours with no 
known drivers, no individual genes reached significance for point muta- 
tions. However, we identified a recurrent CNA that spanned SETD2 in 
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Fig. 4 | Patterns of clustered mutational processes in PCAWG. a, Kataegis. 
Top, prevalence of different types of kataegis and their association with SVs 
(<1kb from the focus). Bottom, the distribution of the number of foci of 
kataegis per sample. Chromoplexy. Prevalence of chromoplexy across cancer 
types, subdivided into balanced translocations and more complex events. 
Chromothripsis. Top, frequency of chromothripsis across cancer types. 
Bottom, for each cancer type acolumnis shown, in whicheachrowisa 
chromothripsis region represented by five coloured rectangles relating to its 
categorization. b, Circos rainfall plot showing the distances between 
consecutive kataegis events across PCAWG compared with their genomic 
position. Lymphoid tumours (khaki, B cell non-Hodgkin’s lymphoma; orange, 
chronic lymphocytic leukaemia) have hypermutation hot spots (=3 foci with 
distance <1 kb; pale red zone), many of which are near known cancer-associated 
genes (red annotations) and have associated SVs (<10 kb from the focus; shown 
as arcs inthe centre).c, Circos rainfall plot as in b that shows the distance versus 


medulloblastomas that lacked known drivers (Fig. 3d), indicating that 
restricting hypothesis testing to missing-driver cases can improve 
power if undiscovered genes are enriched in such tumours. Inactivation 
of SETD2 in medulloblastoma significantly decreased gene expres- 
sion (P= 0.002) (Extended Data Fig. 4c). Notably, SETD2 mutations 
occurred exclusively in medulloblastoma group-4 tumours (P<1x 10%). 
Group-4 medulloblastomas are known for frequent mutations in other 
chromatin-modifying genes“, and our results suggest that SETD2 loss 
of function is an additional driver that affects chromatin regulators in 
this subgroup. 

Two tumour types hada surprisingly high fraction of patients with- 
out identified driver mutations: chromophobe renal cell carcinoma 
(44%; 19 out of 43) and pancreatic neuroendocrine cancers (22%; 
18 out of 81) (Extended Data Fig. 4a). A notable feature of the miss- 
ing-driver cases in both tumour types was a remarkably consistent 


the position of consecutive chromoplexy and reciprocal translocation 
footprints across PCAWG. Lymphoid, prostate and thyroid cancers exhibit 
recurrent events (>2 footprints with distance <10 kb; pale red zone) that are 
likely to be driver SVs and are annotated with nearby genes and associated SVs, 
which are shownas bold and thin arcs for chromoplexy and reciprocal 
translocations, respectively (colours as ina). d, Effect of chromothripsis along 
the genome and involvement of PCAWG driver genes. Top, number of 
chromothripsis-induced gains or losses (grey) and amplifications (blue) or 
deletions (red). Within the identified chromothripsis regions, selected 
recurrently rearranged (light grey), amplified (blue) and homozygously 
deleted (magenta) driver genes are indicated. Bottom, interbreakpoint 
distance between all subsequent breakpoints within chromothripsis regions 
across cancer types, coloured by cancer type. Regions with an average 
interbreakpoint distance <10 kb are highlighted. C[T>N]T, kataegis witha 
pattern of thymine mutationsinaCp TpT context. 


profile of chromosomal aneuploidy—patterns that have previously 
been reported*“* (Fig. 3e). The absence of other identified driver muta- 
tions in these patients raises the possibility that certain combinations 
of whole-chromosome gains and losses may be sufficient to initiate 
a cancer in the absence of more-targeted driver events such as point 
mutations or fusion genes of focal CNAs. 

Even after accounting for technical issues and novel drivers, 5.3% of 
PCAWG tumours still had no identifiable driver events. In a research 
setting, in which we are interested in drawing conclusions about popu- 
lations of patients, the consequences of technical issues that affect 
occasional samples will be mitigated by sample size. Ina clinical setting, 
in which we are interested in the driver mutations ina specific patient, 
these issues become substantially more important. Careful and critical 
appraisal of the whole pipeline—including sample acquisition, genome 
sequencing, mapping, variant calling and driver annotation, as done 
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here—should be required for laboratories that offer clinical sequenc- 
ing of cancer genomes. 


Patterns of clustered mutations and SVs 


Some somatic mutational processes generate multiple mutations ina 
single catastrophic event, typically clustered in genomic space, leading 
to substantial reconfiguration of the genome. Three such processes 
have previously been described: (1) chromoplexy, in which repair of 
co-occurring double-stranded DNA breaks—typically on different chro- 
mosomes-results in shuffled chains of rearrangements*”*8 (Extended 
Data Fig. 5a); (2) kataegis, a focal hypermutation process that leads to 
locally clustered nucleotide substitutions, biased towards a single DNA 
strand*? >! (Extended Data Fig. 5b); and (3) chromothripsis, in which 
tens to hundreds of DNA breaks occur simultaneously, clustered on 
one or a few chromosomes, with near-random stitching together of 
the resulting fragments? °> (Extended Data Fig. 5c). We characterized 
the PCAWG genomes for these three processes (Fig. 4). 

Chromoplexy events and reciprocal translocations were identified 
in 467 (17.8%) samples (Fig. 4a, c). Chromoplexy was prominent in 
prostate adenocarcinoma and lymphoid malignancies, as previously 
described*’“8, and—unexpectedly—thyroid adenocarcinoma. Differ- 
ent genomic loci were recurrently rearranged by chromoplexy across 
the three tumour types, mediated by positive selection for particu- 
lar fusion genes or enhancer-hijacking events. Of 13 fusion genes or 
enhancer hijacking events in 48 thyroid adenocarcinomas, at least 
4 (31%) were caused by chromoplexy, witha further 4 (31%) part of com- 
plexes that contained chromoplexy footprints (Extended Data Fig. 5a). 
These events generated fusion genes that involved RET (two cases) and 
NTRK3 (one case)*, and the juxtaposition of the oncogene /GF2BP3 
with regulatory elements from highly expressed genes (five cases). 

Kataegis events were found in 60.5% of all cancers, with particularly 
high abundance in lung squamous cell carcinoma, bladder cancer, 
acral melanoma and sarcomas (Fig. 4a, b). Typically, kataegis com- 
prises C > N mutations in a TpC context, which are probably caused 
by APOBEC activity” ™, although a T > N conversion ina TpT or CpT 
process (the affected T is highlighted in bold) attributed to error-prone 
polymerases has recently been described”. The APOBEC signature 
accounted for 81.7% of kataegis events and correlated positively with 
APOBEC3B expression levels, somatic SV burden and age at diagnosis 
(Supplementary Fig. 5). Furthermore, 5.7% of kataegis events involved 
the T > N error-prone polymerase signature and 2.3% of events, most 
notably in sarcomas, showed cytidine deamination in an alternative 
GpC or CpC context. 

Kataegis events were frequently associated with somatic SV break- 
points (Fig. 4a and Supplementary Fig. 6a), as previously described), 
Deletions and complex rearrangements were most-strongly associ- 
ated with kataegis, whereas tandem duplications and other simple 
SV classes were only infrequently associated (Supplementary Fig. 6b). 
Kataegis inducing predominantly T > N mutations in CpTpT context 
was enriched near deletions, specifically those in the 10-25-kilobase 
(kb) range (Supplementary Fig. 6c). 

Samples with extreme kataegis burden (more than 30 foci) comprise 
four types of focal hypermutation (Extended Data Fig. 6): (1) off-target 
somatic hypermutation and foci of T > N at CpTpT, found in B cell non- 
Hodgkin lymphoma and oesophageal adenocarcinomas, respectively; 
(2) APOBEC kataegis associated with complex rearrangements, notably 
found in sarcoma and melanoma; (3) rearrangement-independent 
APOBEC kataegis on the lagging strand and in early-replicating regions, 
mainly found in bladder and head and neck cancer; and (4) a mix of 
the last two types. Kataegis only occasionally led to driver mutations 
(Supplementary Table 5). 

We identified chromothripsis in 587 samples (22.3%), most fre- 
quently among sarcoma, glioblastoma, lung squamous cell carci- 
noma, melanoma and breast adenocarcinoma’®. Chromothripsis 
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increased with whole-genome duplications in most cancer types 
(Extended Data Fig. 7a), as previously shown in medulloblastoma*’. 
The most recurrently associated driver was TP53” (pan-cancer odds 
ratio = 3.22; pan-cancer P= 8.3 x 10°; g< 0.05 in breast lobular (odds 
ratio =13), colorectal (odds ratio = 25), prostate (odds ratio = 2.6) and 
hepatocellular (odds ratio = 3.9) cancers; Fisher-Boschloo tests). In 
two cancer types (osteosarcoma and B cell lymphoma), women hada 
higher incidence of chromothripsis than men (Extended Data Fig. 7b). 
In prostate cancer, we observed a higher incidence of chromothripsis 
in patients with late-onset than early-onset disease” (Extended Data 
Fig. 7c). 

Chromothripsis regions coincided with 3.6% of all identified driv- 
ers in PCAWG and around 7% of copy-number drivers (Fig. 4d). These 
proportions are considerably enriched compared to expectation if 
selection were not acting on these events (Extended Data Fig. 7d). The 
majority of coinciding driver events were amplifications (58%), followed 
by homozygous deletions (34%) and SVs within genes or promoter 
regions (8%). We frequently observed a >2-fold increase or decrease in 
expression of amplified or deleted drivers, respectively, when these loci 
were part of a chromothripsis event, compared with samples without 
chromothripsis (Extended Data Fig. 7e). 

Chromothripsis manifested in diverse patterns and frequencies 
across tumour types, which we categorized on the basis of five charac- 
teristics (Fig. 4a). In liposarcoma, for example, chromothripsis events 
often involved multiple chromosomes, with universal MDM2 ampli- 
fication® and co-amplification of TERT in 4 of 19 cases (Fig. 4d). By 
contrast, in glioblastoma the events tended to affect a smaller region 
onasingle chromosome that was distant from the telomere, resulting 
in focal amplification of FGFR and MDM2 and loss of CDKN2A. Acral 
melanomas frequently exhibited CCND1 amplification, and lung squa- 
mous cell carcinomas SOX2 amplifications. In both cases, these drivers 
were more-frequently altered by chromothripsis compared with other 
drivers inthe same cancer type and to other cancer types for the same 
driver (Fig. 4d and Extended Data Fig. 7f). Finally, in chromophobe renal 
cell carcinoma, chromothripsis nearly always affected chromosome 
5 (Supplementary Fig. 7): these samples had breakpoints immediately 
adjacent to TERT, increasing TERT expression by 80-fold on average 
compared with samples without rearrangements (P= 0.0004; Mann- 
Whitney U-test). 


Timing clustered mutations in evolution 


Anunanswered question for clustered mutational processes is whether 
they occur early or late in cancer evolution. To address this, we used 
molecular clocks to define broad epochs in the life history of each 
tumour**™, One transition point is between clonal and subclonal muta- 
tions: clonal mutations occurred before, and subclonal mutations after, 
the emergence of the most-recent common ancestor. In regions with 
copy-number gains, molecular time can be further divided according 
to whether mutations preceded the copy-number gain (and were them- 
selves duplicated) or occurred after the gain (and therefore present on 
only one chromosomal copy)’. 

Chromothripsis tended to have greater relative odds of being clonal 
than subclonal, suggesting that it occurs early in cancer evolution, 
especially in liposarcomas, prostate adenocarcinoma and squamous 
celllung cancer (Fig. 5a). As previously reported, chromothripsis was 
especially common in melanomas™. We identified 89 separate chromo- 
thripsis events that affected 66 melanomas (61%); 47 out of 89 events 
affected genes known to be recurrently altered in melanoma® (Sup- 
plementary Table 6). Involvement of a region on chromosome 11 that 
includes the cell-cycle regulator CCND1 occurred in 21 cases (10 out 
of 86 cutaneous, and 11 out of 21 acral or mucosal melanomas), typi- 
cally combining chromothripsis with amplification (19 out of 21 cases) 
(Extended Data Fig. 8). Co-involvement of other cancer-associated 
genes in the same chromothripsis event was also frequent, including 
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Fig. 5| Timing of clustered events in PCAWG. a, Extent and timing of 
chromothripsis, kataegis and chromoplexy across PCAWG. Top, stacked bar 
charts illustrate co-occurrence of chromothripsis, kataegis and chromoplexy 
inthe samples. Middle, relative odds of clustered events being clonal or 
subclonal are shown with bootstrapped 95% confidence intervals. Point 
estimates are highlighted when they do not overlap odds of 1:1. Bottom, 
relative odds of the events being early or late clonal are shown as above. Sample 


TERT (five cases), CDKN2A (three cases), TP53 (two cases) and MYC 
(two cases) (Fig. 5b). In these co-amplifications, a chromothripsis 
event involving multiple chromosomes initiated the process, creat- 
ing a derivative chromosome in which hundreds of fragments were 
stitched together in a near-random order (Fig. 5b). This derivative 
then rearranged further, leading to massive co-amplification of the 
multiple target oncogenes together with regions located nearby on 
the derivative chromosome. 

In these cases of amplified chromothripsis, we can use the inferred 
number of copies bearing each SNV to time the amplification process. 
SNVs present on the chromosome before amplification will them- 
selves be amplified and are therefore reported in a high fraction of 
sequence reads (Fig. 5b and Extended Data Fig. 8). By contrast, late 
SNVs that occur after the amplification has concluded will be present 
ononly one chromosome copy out of many, and thus have alow variant 
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sizes (number of patients) are shownacross the top. b, Three representative 
patients with acral melanomaand chromothripsis-induced amplification that 
simultaneously affects TERT and CCNDI. The black points (top) represent 
sequence coverage from individual genomic bins, with SVs shown as coloured 
arcs (translocation in black, deletion in purple, duplication in brown, tail-to-tail 
inversionincyanand head-to-head inversion in green). Bottom, the variant 
allele fractions of somatic point mutations. 


allele fraction. Regions of CCND1 amplification had few—sometimes 
zero—mutations at high variant allele fraction in acral melanomas, in 
contrast to later CCND1 amplifications in cutaneous melanomas, in 
which hundreds to thousands of mutations typically predated ampli- 
fication (Fig. 5b and Extended Data Fig. 9a, b). Thus, both chromoth- 
ripsis and the subsequent amplification generally occurred very early 
during the evolution of acral melanoma. By comparison, in lung squa- 
mous cell carcinomas, similar patterns of chromothripsis followed by 
SOX2 amplification are characterized by many amplified SNVs, sug- 
gesting a later event in the evolution of these cancers (Extended Data 
Fig. 9c). 

Notably, in cancer types in which the mutational load was sufficiently 
high, we could detect a larger-than-expected number of SNVs on an 
intermediate number of DNA copies, suggesting that they appeared 
during the amplification process (Supplementary Fig. 8). 
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Fig. 6 |Germline determinants of the somatic mutation landscape. 

a, Association between common (MAF > 5%) germline variants and somatic 
APOBEC3B-like mutagenesis in individuals of European ancestry (n=1,201). 
Two-sided hypothesis testing was performed with PLINK v.1.9. To mitigate 
multiple-hypothesis testing, the significance threshold was set to genome- 
wide significance (P<5 x10°°). b, Templated insertion SVs in a BRCAI- 
associated prostate cancer. Left, chromosome bands (1); SVs <10 megabases 
(Mb) (2);1-kb read depth corrected to copy number 0-6 (3); inter- and 
intrachromosomal SVs > 10 Mb (4). Right, acomplex somatic SV composed of a 
2.2-kb tandem duplication on chromosome 2 together with a 232-base-pair 
(bp) inverted templated insertion SV that is derived from chromosome 5 and 
inserted inbetween the tandem duplication (bottom). Consensus sequence 
alignment of locally assembled Oxford Nanopore Technologies long 
sequencing reads to chromosomes 2 and 5 of the human reference genome 
(top). Breakpoints are circled and marked as 1 (beginning of tandem 
duplication), 2 (end of tandem duplication) or 3 (inverted templated insertion). 
For each breakpoint, the middle panel shows Illumina short reads at SV 


Germline effects on somatic mutations 


We integrated the set of 88 million germline genetic variant calls 
with somatic mutations in PCAWG, to study germline determinants 
of somatic mutation rates and patterns. First, we performed a genome- 
wide association study of somatic mutational processes with common 
germline variants (minor allele frequency (MAF) > 5%) in individuals 
with inferred European ancestry. Anindependent genome-wide associ- 
ation study was performed in East Asian individuals from Asian cancer 
genome projects. We focused on two prevalent endogenous muta- 
tional processes: spontaneous deamination of 5-methylcytosine at 
CpG dinucleotides (signature 1) and activity of the APOBEC3 family of 
cytidine deaminases™ (signatures 2 and 13). No locus reached genome- 
wide significance (P<5 x 10°°) for signature 1 (Extended Data Fig. 10a, 
b). However, alocus at 22q13.1 predicted an APOBEC3B-like mutagen- 
esis at the pan-cancer level® (Fig. 6a). The strongest signal at 22q13.1 
was driven by rs12628403, and the minor (non-reference) allele was 
protective against APOBEC3B-like mutagenesis (B =-0.43, P=5.6 x10°, 
MAF = 8.2%, n = 1,201 donors) (Extended Data Fig. 10c). This variant 
tags acommon, approximately 30-kb germline SV that deletes the 
APOBEC3B coding sequence and fuses the APOBEC3B 3’ untranslated 
region with the coding sequence of APOBEC3A. The deletion is known 
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breakpoints. c, Association between rare germline PT Vs (MAF < 0.5%) and 
somatic CpG mutagenesis (approximately with signature 1) in individuals of 
European ancestry (n=1,201). Genes highlighted in blue or red were associated 
with lower or higher somatic mutation rates. Two-sided hypothesis testing was 
performed using linear-regression models with sex, age at diagnosis and 
cancer project as variables. To mitigate multiple-hypothesis testing, the 
significance threshold was set to exome-wide significance (P< 2.5 x10). 

The black line represents the identity line that would be followed if the 
observed Pvalues followed the null expectation; the shaded area shows 

the 95% confidence intervals. d, Catalogue of polymorphic germline L1 source 
elements that are active in cancer. The chromosomal map shows germline 
source Llelements as volcano symbols. Each volcano is colour-coded 
according to the type of source L1 activity. The contribution of each source 
locus (expressed as a percentage) to the total number of transductions 
identified in PCAWG tumoursis represented as a gradient of volcano size, with 
top contributing elements exhibiting larger sizes. 


to increase breast cancer risk and APOBEC mutagenesis in breast can- 
cer genomes”, Here, we found that rs126284.03 reduces APOBEC3B- 
like mutagenesis specifically in cancer types with low levels of APOBEC 
mutagenesis (B,,y = —0.50, Piow =1* 108; Brigh = 0.17, Prigh = 0.2), and 
increases APOBEC3A-like mutagenesis in cancer types with high lev- 
els of APOBEC mutagenesis (Bpigh = 0.44, Phish = 8 X 107*; Bio = —O.21, 
Pry = 0.02). Moreover, we identified a second, novel locus at 22q13.1 
that was associated with APOBEC3B-like mutagenesis across cancer 
types (rs2142833, B= 0.23, P=1.3 x10°8). Weindependently validated the 
association between both loci and APOBEC3B-like mutagenesis using 
East Asian individuals from Asian cancer genome projects 
(B,s12628403 = 0.57, Prs12628403 = 4-2 * 10°”; B,s2142833 = 0.58, Prsa142g33 = 8 * 10") 
(Extended Data Fig. 10d). Notably, in a conditional analysis that 
accounted for rs12628403, we found that rs2142833 and rs12628403 
are inherited independently in Europeans (7°<0.1), and rs2142833 
remained significantly associated with APOBEC3B-like mutagenesis 
in Europeans (Beyp = 0.17, Peyg = 3 * 10°) and East Asians (Basy = 0.25, 
Pysy= 2 * 10°) (Extended Data Fig. 10e, f). Analysis of donor-matched 
expression data further suggests that rs2142833 is a cis-expression 
quantitative trait locus (eQTL) for APOBEC3B at the pan-cancer level 
(B = 0.19, P=2 x 10°) (Extended Data Fig. 10g, h), consistent with 
cis-eQTL studies in normal cells®*. 
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Fig. 7 | Telomere sequence patterns across PCAWG.a, Scatter plot of the 
clusters of telomere patterns identified across PCAWG using ¢-distributed 
stochastic neighbour embedding (t-SNE), based onn=2,518 tumour samples 
and their matched normal samples. Axes have arbitrary dimensions such that 
samples with similar telomere profiles are clustered together and samples with 
dissimilar telomere profiles are far apart with high probability. b, Distribution 
of the four tumour-specific clusters of telomere patterns in selected tumour 
types from PCAWG. c, Distribution of relevant driver mutations associated 
with alternative lengthening of telomere and normal telomere maintenance 
across the four clusters. d, Distribution of telomere maintenance 
abnormalities across tumour types with more than 40 patients in PCAWG. 
Samples were classified as tumour clusters 1-3 if they fell into a relevant cluster 
without mutations in TERT, ATRX or DAXX and had no ALT phenotype. TMM, 
telomere maintenance mechanisms. 


Second, we performed a rare-variant association study (MAF <0.5%) 
to investigate the relationship between germline PTVs and somatic 
DNA rearrangements in individuals with European ancestry (Extended 
Data Fig. 1la-—c). Germline BRCA2 and BRCA1 PTVs were associated 
with an increased burden of small (less than 10 kb) somatic SV dele- 
tions (P=1x10°°) and tandem duplications (P= 6 x 10%), respectively, 
corroborating recent studies in breast and ovarian cancer”. In 
PCAWG data, this pattern also extends to other tumour types, includ- 
ing adenocarcinomas of the prostate and pancreas‘, typically in the 
setting of biallelic inactivation. In addition, tumours with high lev- 
els of small SV tandem duplications frequently exhibited a novel and 
distinct class of SVs termed ‘cycles of templated insertions*. These 
complex SV events consist of DNA templates that are copied from 
across the genome, joined into one contiguous sequence and inserted 
into a single derivative chromosome. We found a significant associa- 
tion between germline BRCAI PTVs and templated insertions at the 
pan-cancer level (P=4 x 10™) (Extended Data Fig. 11d, e). Whole-genome 


long-read sequencing data generated for a BRCAI-deficient PCAWG 
prostate tumour verified the small tandem-duplication and templated- 
insertion SV phenotypes (Fig. 6b). Almost all (20 out of 21) of BRCA1- 
associated tumours with atemplated-insertion SV phenotype displayed 
combined germline and somatic hits in the gene. Together, these data 
suggest that biallelic inactivation of BRCA1is a driver of the templated- 
insertion SV phenotype. 

Third, rare-variant association analysis revealed that patients with 
germline MBD4 PTVs had increased rates of somatic C > T mutation 
rates at CpG dinucleotides (P< 2.5 x 10°) (Fig. 6c and Extended Data 
Fig. 11f, g). Analysis of previously published whole-exome sequencing 
samples from the TCGA (n=8,134) replicated the association between 
germline MBD4 PTVs and increased somatic CpG mutagenesis at the 
pan-cancer level (P= 7.1 x 10*) (Extended Data Fig. 11h). Moreover, 
gene-expression profiling revealed a significant but modest correlation 
between MBD4 expression and somatic CpG mutation rates between 
and within PCAWG tumour types (Extended Data Fig. 1li-k). MBD4 
encodes a DNA-repair gene that removes thymidines from T:G mis- 
matches within methylated CpG sites”, a functionality that would be 
consistent with a CpG mutational signature in cancer. 

Fourth, we assessed long interspersed nuclear elements (LINE-1; L1 
hereafter) that mediate somatic retrotransposition events” “. We iden- 
tified 114 germline source L1 elements capable of active somatic retro- 
transposition, including 70 that represent insertions with respect to the 
human reference genome (Fig. 6d and Supplementary Table 7), and 53 
that were tagged by single-nucleotide polymorphisms in strong linkage 
disequilibrium (Supplementary Table 7). Only 16 germline L1 elements 
accounted for 67% (2,440 out of 3,669) of all LI1-mediated transduc- 
tions’? detected in the PCAWG dataset (Extended Data Fig. 12a). These 
16 hot-L1 elements followed two broad patterns of somatic activity (8 
of each), which we term Strombolian and Plinian in analogy to patterns 
of volcanic activity. Strombolian L1s are frequently active in cancer, 
but mediate only small-to-modest eruptions of somatic L1 activity in 
cancer samples (Extended Data Fig. 12b). By contrast, Plinian L1s are 
more rarely seen, but display aggressive somatic activity. Whereas 
Strombolian elements are typically relatively common (MAF > 2%) and 
sometimes even fixed in the human population, all Plinian elements 
were infrequent (MAF < 2%) in PCAWG donors (Extended Data Fig. 12c; 
P=0.001, Mann-Whitney U-test). This dichotomous pattern of activ- 
ity and allele frequency may reflect differences in age and selective 
pressures, with Plinian elements potentially inserted into the human 
germline more recently. PCAWG donors bear on average between 50 
and 60 L1 source elements and between 5 and 7 elements with hot 
activity (Extended Data Fig. 12d), but only 38% (1,075 out of 2,814) of 
PCAWG donors carried =1 Plinian element. Some L1 germline source 
loci caused somatic loss of tumour-suppressor genes (Extended Data 
Fig. 12e). Many are restricted to individual continental population 
ancestries (Extended Data Fig. 12f-)). 


Replicative immortality 


One of the hallmarks of cancer is the ability of cancer to evade cellular 
senescence”. Normal somatic cells typically have finite cell division 
potential; telomere attrition is one mechanism to limit numbers of 
mitoses”. Cancers enlist multiple strategies to achieve replicative 
immortality. Overexpression of the telomerase gene, TERT, which main- 
tains telomere lengths, is especially prevalent. This can be achieved 
through point mutations in the promoter that lead to de novo tran- 
scription factor binding**”’; hitching TERT to highly active regulatory 
elements elsewhere in the genome*°”; insertions of viral enhancers 
upstream of the gene”’”; and increased dosage through chromosomal 
amplification, as we have seen in melanoma (Fig. 5b). In addition, there is 
an ‘alternative lengthening of telomeres’ (ALT) pathway, in which telom- 
eres are lengthened through homologous recombination, mediated by 
loss-of-function mutations in the ATRX and DAXX genes”. 
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As reported ina companion paper”, 16% of tumours in the PCAWG 
dataset exhibited somatic mutations in at least one of ATRX, DAXX 
and TERT. TERT alterations were detected in 270 samples, whereas 
128 tumours had alterations in ATRX or DAXX, of which 71 were protein- 
truncating. Inthe companion paper, which focused on describing pat- 
terns of ALT and TERT-mediated telomere maintenance”, 12 features 
of telomeric sequence were measured in the PCAWG cohort. These 
included counts of nine variants of the core hexameric sequence, 
the number of ectopic telomere-like insertions within the genome, 
the number of genomic breakpoints and telomere length as a ratio 
between tumour and normal. Here we used the 12 features as an over- 
view of telomere integrity across all tumours in the PCAWG dataset. 

On the basis of these 12 features, tumour samples formed 4 dis- 
tinct subclusters (Fig. 7a and Extended Data Fig. 13a), suggesting that 
telomere-maintenance mechanisms are more diverse than the well- 
established TERT and ALT dichotomy. Clusters C1 (47 tumours) and 
C2 (42 tumours) were enriched for traits of the ALT pathway—having 
longer telomeres, more genomic breakpoints, more ectopic telomere 
insertions and variant telomere sequence motifs (Supplementary 
Fig. 9). Cland C2 were distinguished from one another by the latter 
having a considerable increase in the number of TTCGGG and TGAGGG 
variant motifs among the telomeric hexamers. Thyroid adenocarci- 
nomas were markedly enriched among C3 samples (26 out of 33 C3 
samples; P<10™°); the Cl cluster (ALT subtype 1) was common among 
sarcomas; and both pancreatic endocrine neoplasms and low-grade 
gliomas had a high proportion of samples in the C2 cluster (ALT sub- 
type 2) (Fig. 7b). Notably, some of the thyroid adenocarcinomas and 
pancreatic neuroendocrine tumours that cluster together (cluster C3) 
had matched normal samples that also cluster together (normal cluster 
N3) (Extended Data Fig. 13a) and which share common properties. For 
example, the GIAGGG repeat was overrepresented among samplesin 
this group (Supplementary Fig. 10). 

Somatic driver mutations were also unevenly distributed across the 
four clusters (Fig. 7c).C1 tumours were enriched for RBI mutations or 
SVs (P=3 x10), as well as frequent SVs that affected ATRX (P=6 x10), 
but not DAXX. RB1 and ATRX mutations were largely mutually exclusive 
(Extended Data Fig. 13b). By contrast, C2 tumours were enriched for 
somatic point mutations in ATRX and DAXX (P=6 x 10°), but not RBI. 
The enrichment of RBI mutations in C1 remained significant when 
only leiomyosarcomas and osteosarcomas were considered, confirm- 
ing that this enrichment is not merely a consequence of the different 
distribution of tumour types across clusters. C3 samples had frequent 
TERT promoter mutations (30%; P=2 10°). 

There was a marked predominance of RBI mutations in C1. Nearly 
a third of the samples in Cl contained an RB/ alteration, which were 
evenly distributed across truncating SNVs, SVs and shallow dele- 
tions (Extended Data Fig. 13c). Previous research has shown that RBI 
mutations are associated with long telomeres in the absence of TERT 
mutations and ATRX inactivation®, and studies using mouse models 
have shown that knockout of Rb-family proteins causes elongated 
telomeres®. The association with the Cl cluster here suggests that RB1 
mutations can represent another route to activating the ALT pathway, 
which has subtly different properties of telomeric sequence com- 
pared with the inactivation of DAXX—these fall almost exclusively in 
cluster C2. 

Tumour types with the highest rates of abnormal telomere mainte- 
nance mechanisms often originate in tissues that have low endogenous 
replicative activity (Fig. 7d). In support of this, we found an inverse cor- 
relation between previously estimated rates of stem cell division across 
tissues® and the frequency of telomere maintenance abnormalities 
(P=0.01, Poisson regression) (Extended Data Fig. 13d). This suggests 
that restriction of telomere maintenance is an important tumour- 
suppression mechanism, particularly in tissues with low steady-state 
cellular proliferation, in which aclone must overcome this constraint 
to achieve replicative immortality. 
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Conclusions and future perspectives 


The resource reported in this paper and its companion papers has 
yielded insights into the nature and timing of the many mutational 
processes that shape large- and small-scale somatic variation in the 
cancer genome; the patterns of selection that act on these varia- 
tions; the widespread effect of somatic variants on transcription; 
the complementary roles of the coding and non-coding genome for 
both germline and somatic mutations; the ubiquity of intratumoral 
heterogeneity; and the distinctive evolutionary trajectory of each 
cancer type. Many of these insights can be obtained only from an 
integrated analysis of all classes of somatic mutation on a whole- 
genome scale, and would not be accessible with, for example, targeted 
exome sequencing. 

The promise of precision medicine is to match patients to targeted 
therapies using genomics. A major barrier toits evidence-based imple- 
mentation is the daunting heterogeneity of cancer chronicled in these 
papers, from tumour type totumour type, from patient to patient, from 
clonetocloneand from cell to cell. Building meaningful clinical predic- 
tors from genomic data can be achieved, but will require knowledge 
banks comprising tens of thousands of patients with comprehensive 
clinical characterization®. As these sample sizes will be too large for 
any single funding agency, pharmaceutical company or healthsystem, 
international collaboration and data sharing will be required. The next 
phase of ICGC, ICGC-ARGO (https:// www.icgc-argo.org/), will bring 
the cancer genomics community together with healthcare providers, 
pharmaceutical companies, data science and clinical trials groups to 
build comprehensive knowledge banks of clinical outcome and treat- 
ment data from patients with a wide variety of cancers, matched with 
detailed molecular profiling. 

Extending the story begun by TCGA, ICGC and other cancer genom- 
ics projects, the PCAWG has brought us closer to a comprehensive 
narrative of the causal biological changes that drive cancer phenotypes. 
We must now translate this knowledge into sustainable, meaningful 
clinical treatments. 
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Methods 


Samples 

We compiled an inventory of matched tumour-normal whole-cancer 
genomes in the ICGC Data Coordinating Centre. Most samples came 
from treatment-naive, primary cancers, although a small number of 
donors had multiple samples of primary, metastatic and/or recurrent 
tumours. Our inclusion criteria were: (1) matched tumour and normal 
specimen pair; (2) a minimal set of clinical fields; and (3) characteri- 
zation of tumour and normal whole genomes using IIlumina HiSeq 
paired-end sequencing reads. 

We collected genome data from 2,834 donors, representing all ICGC 
and TCGA donors that met these criteria at the time of the final data 
freeze in autumn 2014 (Extended Data Table 1). After quality assurance 
(Supplementary Methods 2.5), data from 176 donors were excluded 
as unusable, 75 had minor issues that could affect some analyses 
(grey-listed donors) and 2,583 had data of optimal quality (white-listed 
donors) (Supplementary Table 1). Across the 2,658 white- and grey- 
listed donors, whole-genome sequences were available from 2,605 
primary tumours and 173 metastases or local recurrences. Matching 
normal samples were obtained from blood (2,064 donors), tissue 
adjacent to the primary tumour (87 donors) or from distant sites (507 
donors). Whole-genome sequencing data were available for tumour 
and normal DNA for the entire cohort. The mean read coverage was 
39x for normal samples, whereas tumours had a bimodal coverage 
distribution with modes at 38x and 60x (Supplementary Fig. 1). The 
majority of specimens (65.3%) were sequenced using 101-bp paired- 
end reads. An additional 28% were sequenced with 100-bp paired-end 
reads. Of the remaining specimens, 4.7% were sequenced with read 
lengths longer than 101 bp, and 1.9% with read lengths shorter than 
100 bp. The distribution of read lengths by tumour cohort is shownin 
Supplementary Fig. 11. Median read length for whole-genome 
sequencing paired-end reads was 101 bp (mean = 106.2, s.d. = 16.7; 
minimum-maximum = 50-151). RNA-sequencing data were collected 
and re-analysed centrally for 1,222 donors, including 1,178 primary 
tumours, 67 metastases or local recurrences and 153 matched normal 
tissue samples adjacent to the primary tumour. 

Demographically, the cohort included 1,469 men (55%) and 
1,189 women (45%), with a mean age of 56 years (range, 1-90 years) 
(Supplementary Table 1). Using population ancestry-differentiated 
single nucleotide polymorphisms, the ancestry distribution was heavily 
weighted towards donors of European descent (77% of total) followed 
by East Asians (16%), as expected for large contributions from European, 
North American and Australian projects (Supplementary Table 1). 

We consolidated histopathology descriptions of the tumour sam- 
ples, using the ICD-0-3 tumour site controlled vocabulary®. Overall, 
the PCAWG dataset comprises 38 distinct tumour types (Extended 
Data Table 1and Supplementary Table 1). Although the most common 
tumour types are included in the dataset, their distribution does not 
match the relative population incidences, largely owing to differences 
among contributing ICGC/TCGA groups in the numbers of sequenced 
samples. 


Uniform processing and somatic variant calling 

To generate a consistent set of somatic mutation calls that could be 
used for cross-tumour analyses, we analysed all 6,835 samples using a 
uniform set of algorithms for alignment, variant calling and quality con- 
trol (Extended Data Fig. 1, Supplementary Fig. 2, Supplementary Table 3 
and Supplementary Methods 2). We used the BWA-MEM algorithm” 
to align each tumour and normal sample to human reference build 
hs37d5 (as used in the 1000 Genomes Project”). Somatic mutations 
were identified in the aligned data using three established pipelines, 
which were run independently on each tumour-normal pair. Each of the 
three pipelines—labelled ‘Sanger’”” ®, ‘EMBL/DKFZ’°” and ‘Broad’> 
after the computational biology groups that created or assembled 


them—consisted of multiple software packages for calling somatic 
SNVs, smallindels, CNAs and somatic SVs (with intrachromosomal SVs 
defined as those >100 bp). Two additional variant algorithms!" were 
included to further improve accuracy across a broad range of clonal 
and subclonal mutations. We tested different merging strategies using 
validation data, and choses the optimal method for each variant type 
to generate a final consensus set of mutation calls (Supplementary 
Methods 82.4). 

Somatic retrotransposition events, including Alu and LINE-1 inser- 
tions”, L1-mediated transductions” and pseudogene formation’, were 
called using a dedicated pipeline”. We removed these retrotransposi- 
tion events from the somatic SV call-set. Mitochondrial DNA mutations 
were called using a published algorithm’. RNA-sequencing data were 
uniformly processed to quantify normalized gene-level expression, 
splicing variation and allele-specific expression, and to identify fusion 
transcripts, alternative promoter usage and sites of RNA editing®. 


Integration, phasing and validation of germline variant call-sets 
Calls of common (21% frequency in PCAWG) and rare (<1%) germline 
variants including single-nucleotide polymorphisms, indels, SVs and 
mobile-element insertions (MEIs) were generated using a population- 
scale genetic polymorphism-detection approach”. The uniform 
germline data-processing workflow comprised variant identification 
using six different variant-calling algorithms’°”"’ and was orches- 
trated using the Butler workflow system”. 

We performed call-set benchmarking, merging, variant genotyp- 
ing and statistical haplotype-block phasing”! (Supplementary Meth- 
ods 3.4). Using this strategy, we identified 80.1 million germline 
single-nucleotide polymorphisms, 5.9 million germline indels, 1.8 mil- 
lion multi-allelic short (<50 bp) germline variants, as well as germline 
SVs => 50 bp in size including 29,492 biallelic deletions and 27,254 MEIs 
(Supplementary Table 2). We statistically phased this germline variant 
set using haplotypes from the 1000 Genomes Project”! as a reference 
panel, yielding an N50-phased block length of 265 kb based on haploid 
chromosomes from donor-matched tumour genomes. Precision esti- 
mates for germline SNVs and indels were >99% for the phased merged 
call-set, and sensitivity estimates ranged from 92% to 98%. 


Core alignment and variant calling by cloud computing 

The requirement to uniformly realign and call variants on nearly 5,800 
whole genomes (tumour plus normal) presented considerable com- 
putational challenges, and raised ethical issues owing to the use of 
data from different jurisdictions (Extended Data Table 2). To process 
the data, we adopted a cloud-computing architecture” in which the 
alignment and variant calling was spread across 13 data centres on3 
continents, representing a mixture of commercial, infrastructure-as- 
a-service, academic cloud compute and traditional academic high- 
performance computer clusters (Supplementary Table 3). Together, 
the effort used 10 million CPU-core hours. 

To generate reproducible variant calling across the 13 data centres, 
we built the core pipelines into Docker containers”, in which the work- 
flow description, required code and all associated dependencies were 
packaged together in stand-alone packages. These heavily tested, exten- 
sively validated workflows are available for download (Box 1). 


Validation, benchmarking and merging of somatic variant calls 

To evaluate the performance of each of the mutation-calling pipelines 
and determine an integration strategy, we performed a large-scale 
deep-sequencing validation experiment (Supplementary Notes 1). We 
selected a pilot set of 63 representative tumour-normal pairs, on which 
weranthe3core pipelines, together with a set of 10 additional somatic 
variant-calling pipelines contributed by members of the PCAWG SNV 
Calling Methods Working Group. Sufficient DNA remained for 50 of 
the 63 cases for validation, which was performed by hybridization of 
tumour and matched normal DNA to acustom RNA bait set, followed 
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by deep sequencing, as previously described”’. Although performed 
using the same sequencing chemistry as the original whole-genome 
sequencing analyses, the considerably greater depth achieved in the 
validation experiment enabled accurate assessment of sensitivity and 
precision of variant calls. Variant calls in repeat-masked regions were 
not tested, owing to the challenge of designing reliable validation 
probes in these areas. 

The3 core pipelines had individual estimates of sensitivity of 80-90% 
to detect atrue somatic SNV called by any of the 13 pipelines; with >95% 
of SNV calls made by each of the core pipelines being genuine somatic 
variants (Fig. 1a). For indels—a more-challenging class of variants to 
identify in short-read sequencing data—the 3 core algorithms had indi- 
vidual sensitivity estimates in the range of 40-50%, with precision 
70-95% (Fig. 1b). Validation of SV calls is inherently more difficult, 
as methods based on PCR or hybridization to RNA baits often fail to 
isolate DNA that spans the breakpoint. To assess the accuracy of SV 
calls, we therefore used the property that an SV must either generate 
a copy-number change or be balanced, whereas artefactual calls will 
not respect this property. For individual SV-calling algorithms, we 
estimated precision to be in the range of 80-95% for samples in the 
63-sample pilot dataset. 

Next, we examined multiple methods for merging calls made by 
several algorithms into a single definitive call-set to be used for down- 
stream analysis. The final consensus calls for SNVs were based onasim- 
ple approach that required two or more methods to agree on acall. For 
indels, because methods were less concordant, we used stacked logistic 
regression” to integrate the calls. The merged SV set includes all calls 
made bytwoor more of the four primary SV-calling algorithms’°1001?2, 
Consensus CNA calls were obtained by joining the outputs of six indi- 
vidual CNA-calling algorithms with SV consensus breakpoints to obtain 
base-pair resolution CNAs (Supplementary Methods 2.4.3). Consensus 
purity and ploidy were derived, and a multitier system was developed 
for consensus copy-number calls (Supplementary Methods 2.4.3, and 
described in detail elsewhere’). 

Overall, the sensitivity and precision of the consensus somatic vari- 
ant calls were 95% (90% confidence interval, 88-98%) and 95% (90% 
confidence interval, 71-99%), respectively, for SNVs (Extended Data 
Fig. 2). For somatic indels, sensitivity and precision were 60% (90% con- 
fidence interval, 34-72%) and 91% (90% confidence interval, 73-96%), 
respectively. Regarding SVs, we estimate the sensitivity of the merging 
algorithm to be 90% for true calls generated by any one calling pipeline; 
precision was estimated to be 97.5%. That is, 97.5% of SVs in the merged 
SV call-set had an associated copy-number change or balanced partner 
rearrangement. The improvement in calling accuracy from combining 
different pipelines was most noticeable in variants that had low variant 
allele fractions, which are likely to originate from subclonal popula- 
tions of the tumour (Fig. 1c, d). There remains much work to be done 
to improve indel calling software; we still lack sensitivity for calling 
even fully clonal complex indels from short-read sequencing data. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The PCAWG-generated alignments, somatic variant calls, annotations 
and derived datasets are available for general research use for browsing 
and download at http://dcc.icgc.org/pcawg/ (Box 1and Supplementary 
Table 4). In accordance with the data access policies of the ICGC and 
TCGA projects, most molecular, clinical and specimen data are in an 
open tier which does not require access approval. To access poten- 
tially identifying information, such as germline alleles and underly- 
ing read data, researchers will need to apply to the TCGA Data Access 
Committee (DAC) via dbGaP (https://dbgap.ncbi.nlm.nih.gov/aa/wga. 


cgi?page=login) for access to the TCGA portion of the dataset, and to 
the ICGC Data Access Compliance Office (DACO; http://icgc.org/daco) 
for the ICGC portion. In addition, to access somatic single nucleotide 
variants derived from TCGA donors, researchers will also need to obtain 
dbGaP authorization. 

Beyond the core sequence data and variant call-sets, the analyses in 
this paper used anumber of datasets that were derived from the variant 
calls (Supplementary Table 4). The individual datasets are available at 
Synapse (https://www.synapse.org/), and are denoted with synXXXXX 
accession numbers; all these datasets are also mirrored at https://dcc. 
icgc.org, with full links, filenames, accession numbers and descriptions 
detailed in Supplementary Table 4. The datasets encompass: clinical 
data fromeach patient including demographics, tumour stage and vital 
status (syn10389158); harmonized tumour histopathology annotations 
using a standardised hierarchical ontology (syn1038916); inferred 
purity and ploidy values for each tumour sample (syn8272483); driver 
mutations for each patient from their cancer genome spanning all 
classes of variant, and coding versus non-coding drivers (syn11639581); 
mutational signatures inferred from PCAWG donors (syn11804065), 
including APOBEC mutagenesis (syn7437313); and transcriptional data 
from RNA sequencing, including gene expression levels (syn5553985, 
syn5553991, syn8105922) and gene fusions (synl10003873, syn7221157). 


Code availability 


Computational pipelines for calling somatic mutations are available to 
the public at https://dockstore.org/organizations/PCAWG/collections/ 
PCAWG. A range of data-visualization and -exploration tools are also 
available for the PCAWG data (Box 1). 
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Extended Data Fig. 1| Flow-chart showing key steps in the analysis of PCAWG 
genomes. After alignment to the genome, somatic mutations were identified 
by three pipelines, with subsequent merging into a consensus variant set used 


for downstream scientific analyses. Subs, substitutions; DKFZ/EMBL, the 
German Cancer Research Centre (DKFZ) and Europen Molecular Biology 
Laboratory (EMBL). 
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Extended Data Fig. 2 | Distribution of accuracy estimates across algorithms dataset. b, F, accuracy, precision and sensitivity estimates for somatic indels 
and samples from validation data. a, F, accuracy, precision and sensitivity (n=50 samples). SVM, support vector machine; union, calls made by all variant- 
estimates for somatic SNVs across the core algorithms and different calling algorithms; intersect2, calls made by any combination of two variant- 
approaches to merging the call-sets. The box plots demarcatetheinterquartile calling algorithms; intersect3, calls made by any three variant-calling 

range and median of estimates across then=50 samples in the validation algorithms. 
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indicate substitutions; indels are taken as insertions or deletions <100 bp in 


Extended Data Fig. 3 | Distribution of numbers of somatic mutations of 


size; retrotranspositions are the combined counts of somatic retrotransposon 


insertions, transductions and somatic pseudogene insertions. 


different classes across tumour types. They axis is onalog scale. The 2,583 


donors with the highest quality metrics (white-listed donors) are plotted. SNVs 
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Extended Data Fig. 4| Patients with no detected driver mutationsinPCAWG. sensitivity values for that tumour type (the width proportional is to density). 


a, Number (red) of patients without detected driver mutations distributed c, SETD2 expression levels across different medulloblastoma subtypes. Points 
across the different tumour types studied. b, Estimated sensitivity for represent individual patients, coloured by whether the gene exhibited focal 
detecting somatic point mutations genome-wide across tumour types (total copy number (CN) loss or a truncating point mutation, or was the wild-type 


sample size: n=2,583 patients). Each point represents the estimate for a single gene. The coloured areas are violin plots showing the estimated density 
patient, layered on violin plots that show the estimated density distribution of distribution of expression values for that medulloblastoma subtype. 
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Extended Data Fig. 5 | See next page for caption. 
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Extended Data Fig. 5| Examples of clustered mutational processes. 

a, Chromoplexy example ina thyroid adenocarcinoma. Genes at the 
breakpoints are schematically depicted in their normal genomic context and 
again inthe reconstructed derivative chromosomes below. b, Distinct kataegis 
signatures in the genome of a pancreatic adenocarcinoma sample. SVs and 
their classification are shown above the main rainfall plot, as well as the total 
and minor allele copy number. Tra, translocation; del, deletion; dup, 
duplication; t2tInv, tail-to-tail inversion; h2hInv, head-to-head inversion. 
Magnifications of the three focion chromosomes 1, 8 and 12, respectively, 
highlight distinct manifestations of kataegis. Left, anovel process similar to 
signature 17 with T > N mutations at CT or TT dinucleotides. Middle, the 


prototypical APOBEC3A/B type withC > T (signature 2) and/or C>G/A 
(signature 13) substitutions at TpC. Right, an alternative cytidine deaminase(s) 
witha preference for substitutions at C/GpC. Most of the SNVsin each of these 
foci can be phased to the sameallele and no evidence of anti-phasing is 
observed. c, Example of achromothripsis event ina melanoma. The black 
points (top) represent copy-number estimates from individual genomic bins, 
with SVs shownas coloured arcs (translocation in black, deletion in purple, 
duplication in brown, tail-to-tail inversion in cyan, head-to-head inversionin 
green) that mostly demarcate copy-number changes. The mate chromosomes 
are displayed above translocations. Bottom, the variant allele fractions of 
somatic mutations distributed along the relevant chromosomal region. 
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Extended Data Fig. 6| Patterns of intense kataegis. a, Distribution of the replication timing bias (c) for rearrangement-(in)dependent APOBEC kataegis, 
tumour types (colour-coded as in Extended Data Fig. 3) of the samples in the based onn=2,583 tumours. Pvalues were derived using a two-sided Mann-— 
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combination of the last two. b, c, Distribution of leading/lagging strand (b) and 
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Extended Data Fig. 7| See next page for caption. 


Extended Data Fig. 7 | Association of chromothripsis with covariates and 
driver events. a, Odds ratios per cancer type of containing chromothripsis in 
whole-genome duplicated versus diploid samples (n = 2,583 patients). 

***q < 0.001; **q < 0.01; *q < 0.05. Two-sided hypothesis testing was performed 
using Fisher-Boschloo tests, corrected for multiple-hypothesis testing. 

b, Same asa for female versus male. c, Proportion of mutations explained by 
single-base substitution signature 1 and age at diagnosis in prostate cancer 
samples (n=210 patients) with or without chromothripsis (¢< 0.05). The early- 
onset prostate cancer project drives the signal and was sequenced at lower 
depth. For the box-and-whisker plots, the box denotes the interquartile range, 
with the median marked asa horizontal line. The whiskers extend as far as the 
range or 1.5x the interquartile range, whichever is less. Two-sided hypothesis 
testing was performed using Mann-Whitney U-tests. d, Counts of co- 
occurrence of chromothripsis with amplification (blue) and homozygous 
deletions (red) in driver regions: observed (thick line) versus randomized 
(shaded areaand thin line). The cumulative number of drivers that were hit is 


plotted asa function of the number of times those drivers were hit. e, For each 
sample in which chromothripsis coincided witha driver event in those genes, 
we show the fold change in gene expression compared tothe median 
expression of the gene innon-chromothripsis samples of the same cancer type, 
coloured by cancer type and shaped by the type of driver event. We show with 
added transparency the fold changes calculated the same way for samples with 
driver mutations hitting the same driver genes, but that had no evidence of 
chromothripsis. Analysis is based onn=1,222 patients with RNA-sequencing 
data. f, Enrichment of co-occurrence of chromothripsis with driver events. The 
xaxis shows the association of chromothripsis with a driver ina given cancer 
type compared with its rate of association with that driver in all other cancer 
types. The yaxis shows the association of chromothripsis witha driverina 
given cancer type compared with its rate of association with all other drivers in 
that type. Exact binomial tests are used and Pvalues are corrected for multiple 
testing according to the Benjamini-Hochberg method. 
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Extended Data Fig. 8| See next page for caption. 
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Extended Data Fig. 8 | Further examples of chromothripsis-induced 
amplification targeting multiple cancer-associated genes simultaneously 
in melanoma. a, Examples of amplifications that occurred early inthe 
development of melanoma. The black points (top) represent copy-number 
estimates from individual genomic bins, with SVs shown as coloured arcs 
(translocation in black, deletion in purple, duplication in brown, tail-to-tail 
inversionin cyanand head-to-head inversion in green) that mostly demarcate 
copy-number changes. Bottom, the variant allele fractions of SNVs distributed 


along the relevant chromosomal region. The paucity of somatic mutations at 
high variant allele fractions in the most-heavily amplified regions indicates that 
these amplifications began very early in tumour evolution, before the lineage 
had had opportunity to acquire many SNVs. b, Example of an amplification that 
occurred late in melanoma development. The large numbers of somatic 
mutations at high variant allele fractions in the most-heavily amplified regions 
indicate that these amplifications began late in tumour evolution, after the 
lineage had already acquired many SNVs. 
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Extended Data Fig. 9| See next page for caption. 


Extended Data Fig. 9 | Timing the amplifications after chromothripsis in 
molecular time for 10 representative cases. a, Copy-number plot of 
chromothriptic regions categorized as ‘liposarc-like’ in five acral melanomas 
with CCND1 amplification. Segments indicate the copy number of the major 
allele. Points represent SNV multiplicities, that is, the estimated number of 
copies carrying each SNV, coloured by base change and shaped by strand. Small 
vertical arrows link SNVs to their corresponding copy-number segment. 
Kataegis foci are shown within black boxes and show typical strand 
specificities (all triangles or all circles), similar multiplicities and base changes 
of signatures 2 and 13 (red and black, respectively). A coloured bar (top right) 


represents the molecular timing of the amplification (red bar; highis early, low 
is late) and is coloured by the fraction of total SNVs assigned to the following 
timing categories: clonal [early], clonal mutations that occurred before 
duplications involving the relevant chromosome (including whole-genome 
duplications); clonal [late], clonal mutations that occurred after such 
duplications; and clonal [NA], mutations that occurred when no duplication 
was observed. b, Same as ain two cutaneous melanomas, one shows early 
amplification, the other late amplification. c, Same asa, b, for three lung 
squamous cell carcinomas and late amplification of SOX2. 
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Extended Data Fig. 10 | See next page for caption. 
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Extended Data Fig. 10 | Association between common germline variants and 
endogenous mutational processes. Genome-wide association of somatic CpG 
mutagenesis in individuals of European ancestry (n=1,201 patients) based on 
mutational signature analysis (a) and NpCpG motif analysis (b). Two-sided 
hypothesis testing was performed using PLINK v.1.9. To mitigate multiple- 
hypothesis testing, the significance threshold was set to genome-wide 
significance (P<5 10°). c,d, Locuszoom plot for somatic APOBEC3B-like 
mutagenesis association results, linkage disequilibrium and recombination 
rates around the genome-wide significant 22q13.1locus in individuals with 
European (c) and East Asian (d) ancestry (n=1,201and 318 patients, 


respectively). Locuszoom plot for somatic APOBEC3B-like mutagenesis 
association results around the 22q13.1 locus in individuals with European (e) 
and East Asian (f) ancestry after conditioning onrs12628403.g,h, Association 
between rs2142833 and expression of APOBEC3 genes in PCAWG tumour 
samples (adjusted for sex, age at diagnosis, histology and population structure 
in linear-regression models with two-sided hypothesis testing not corrected 
for multiple tests). For the box-and-whisker plot, the box denotes the 
interquartile range, with the median marked as a horizontal line. The whiskers 
extend as far as the range or 1.5x the interquartile range, whichever is less. 
Outliers are shownas points. 
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Extended Data Fig. 11| See next page for caption. 


Extended Data Fig. 11| Association between rare germline PTVs in protein- 
coding genes and somatic mutational phenotypes. a—d, f, Data are based on 
two-sided rare-variant association testing across n= 2,583 patients, witha 
stringent Pvalue threshold of P<2.5 x 10° used to mitigate multiple- 
hypothesis testing (significant genes marked with coloured circles). Blue/red 
circles mark genes that decrease/increase somatic mutation rates. The black 
line represents the identity line that would be followed if the observed Pvalues 
followed the null expectation, with the shaded area showing the 95% 
confidence intervals. a, QQ plots for the proportion of somatic SV deletions, 
tandem duplications, inversions and translocation in cancer genomes. b, QQ 
plots for the proportion of somatic SV deletions in cancer genomes stratified 
by four size groups (1-10 kb, 10-100 kb, 100-1,000 kb and >1,000 kb). c, QQ 
plots for the proportion of somatic SV tandem duplications in cancer genomes 


stratified by four size groups (1-10 kb, 10-100 kb, 100-1,000 kb and >1,000 kb). 


d, QQ plot for the presence or absence of somatic SV templated insertion 
(cycles) incancer genomes. e, Number of SV-templated insertion cycles in 
PCAWG tumours with germline BRCAI PTVs. Only histological samples with at 
least one germline BRCAI PTV carrier are shown (n=1,095 patients combined). 
The box denotes the interquartile range, with the median marked asa 
horizontal line. The whiskers extend as far as the range or 1.5x the interquartile 
range, whichever is less. Outliers are shown as points. f, QQ plot for somatic 
CpG mutagenesis in cancer genomes based on NpCpG motif analysis. g, Violin 
plots show estimated densities of the proportion of somatic CpG mutationsin 
PCAWG donors with germline MBD4 and BRCA2PTVs. The box denotes the 


interquartile range, with the median marked asa white point. The whiskers 
extendas far as the range or 1.5x the interquartile range, whichever is less. Two- 
sided hypothesis testing, not corrected for multiple testing, was performed 
using linear regression models. h, Replication of germline MBD4 and BRCA2 
PTV associations with somatic CpG mutagenesis in TCGA whole-exome 
sequencing donors. Violin plots show the estimated density of the proportion 
of somatic CpG mutations in TCGA exomes with germline MBD4 and BRCA2 
PTVs. The box denotes the interquartile range, with the median marked asa 
white point. The whiskers extend as far as the range or 1.5x the interquartile 
range, whichever is less. Two-sided hypothesis testing, not corrected for 
multiple testing, was performed using linear-regression models. i, Correlation 
between MBD4 expression and somatic CpG mutagenesis in primary solid 
PCAWG tumours. Hypothesis testing was two-sided and not corrected for 
multiple testing, using linear-regression models. The box denotes the 
interquartile range, with the median marked asa horizontal line. The whiskers 
extendas far as the range or 1.5x the interquartile range, whichever is less. 

j, Data are mean +s.e.m. across n=20 tumour types. The dashed black line 
shows the fitted line to the data, estimated using linear-regression models. 
Hypothesis testing was two-sided and not corrected for multiple testing, using 
Spearman’s rank correlations. k, MBD4 effect sizes (open circles) with 95% 
confidence intervals (error bars) for individual cancer types were estimated 
using linear-regression analysis after (if available) accounting for sex, age at 
diagnosis (young/old) and ICGC project. Hypothesis testing was two-sided and 
not corrected for multiple testing. 
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Extended Data Fig. 12|See next page for caption. 
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Extended Data Fig. 12 | Germline MEI call set. a, Left, dots show the number of 
transductions promoted by each hot element in individual samples. Arrows 
highlight retrotransposition burst. Right, the contribution of each hot locusis 
represented. The total number of transductions mediated by each source 
elementis shown onthe right. b, Source L1activity rate (that is, measured as the 
average number of transductions mediated by an element) versus the 
percentage of samples with retrotransposition activity in which the germline 
elementis active. For visualization purposes, extreme points observed fora 
source L1 with an activity rate of 49 and for aLlactive in 31% of the samples are 
shownat >20 and =10, respectively. c, Contrasting allele frequencies for 
Strombolian and Plinian source loci (sample sizes shown under each axis label). 
The box denotes the interquartile range, with the median marked as a white 
point. The whiskers extend as far as the range or 1.5x the interquartile range, 
whichever is less. Hypothesis testing was performed using two-sided Mann- 
Whitney U-tests without correction for multiple tests. d, Numbers of active and 
hot source L1 elements per donor. Data are mean+s.d. number of elements per 
donor. e, The novel Plinian source element on 7p12.3 mediates 72 transductions 
among only 6 cancer samples. This generates a transduction that induces the 
deletion of the tumour-suppressor gene CDKNZ2A. f, Violin plots show the 
estimated number of distinct germline MEI alleles per PCAWG donor. The box 
denotes the interquartile range, with the median marked asa white point. The 
whiskers extend as far as the range or 1.5x the interquartile range, whichever is 
less. Donors are grouped according to their genetic ancestry: AFR, African; 


AMR, admixed American; EAS, East Asian; EUR, European; SAS, South Asian. 
Sample sizes are shown under each axis label. g, For each type of MEI (L1, Alu 
and SVA) identified both in PCAWG and in the 1000 Genomes Project (1KGP), 
the correlations between allele frequency estimates per ancestry derived from 
both projects are displayed ina blue (0) to red (1) coloured gradient. n=2,583 
PCAWG patients. Two-sided hypothesis testing was performed using 
Spearman’s rank correlations without correction for multiple tests. h, Example 
correlation between MEL allele frequencies derived from PCAWG and the 1000 
Genomes Project for individuals with European ancestry (n=1,201 patients in 
PCAWG). Two-sided hypothesis testing was performed using Spearman’s rank 
correlations without correction for multiple tests. i, Evaluation of TraFiC-mem 
false-discovery rate ona liver hepatocellular carcinoma sample (DO50807) and 
acell line (NCI-BL2087) sequenced using single-molecule sequencing with 
MinION (Oxford Nanopore). For each allele frequency bin (common, >5%; low 
frequency, 1-5%; rare, <1%), the percentage of events supported by Nlong reads 
is represented (Nranges from O-1to more than 5). MEIs supported by at least 
two Nanopore reads were considered to be true positives (blue palette) and 
were classified as false positives (red) otherwise. The total number of germline 
MEIs per allele frequency bin is shown onthe right.j, Correlation between 
predicted MEI lengths from Illumina and Nanopore data. Two-sided hypothesis 
testing was performed using Spearman’s rank correlations without correction 
for multiple testing. 
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Extended Data Fig. 13 | See next page for caption. 
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Extended Data Fig. 13 | Different mechanisms of telomere lengthening in 
cancer. a, Scatter plot showing the four clusters of tumour-specific telomere 
patterns identified across PCAWG samples, together with the clusters of 
matched normal samples, generated by t-distributed stochastic neighbour 
embedding. Circles represent tumour samples and triangles represent 
matched normal samples. Points are coloured by tissue of origin. Data are 
based onn=2,518 tumour samples and their matched normal samples. 

b, Patterns of comutation of the relevant driver mutations across individual 
patients. Columns in plot represent individual patients, coloured by type of 
abnormality observed.c, Distribution of clonality of driver mutations in genes 
relevant to telomere maintenance across clusters. Clonal [early], clonal 


mutations that occurred before duplications involving the relevant 
chromosome (including whole-genome duplications); clonal [late], clonal 
mutations that occurred after such duplications; and clonal [NA], mutations 
that occurred when no duplication was observed. d, Relationship between the 
estimated number of stem cell divisions per year and rate of telomere 
maintenance abnormalities across tumour types. The analysis uses dataon 
estimated rates of stem cell division per year across n=19 tissue types 
previously collated from the literature®. Tumour types are coloured according 
to the scheme shown in Extended Data Fig. 3. Two-sided hypothesis testing was 
performed using likelihood ratio tests on Poisson regression models with no 
correction for multiple tests. 
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Extended Data Table 1 | Overview of the tumour types included in PCAWG project 


Adeno., adenocarcinoma; Ca., carcinoma; Comb., combined; F, female; HCC, hepatocellular carcinoma; M, male; Med, median; 10-90th, 10-90th centiles; SCC, squamous cell carcinoma. 


CNS CNS-GBM Glioblastoma 41 13 28 60 43-72 
CNS CNS-Medullo Medulloblastoma and variants 146 67 79 9 3-28 
CNS CNS-Oligo Oligodendroglioma 18 9 9 41 21-62 
CNS CNS-PiloAstro Pilocytic astrocytoma 89 47 42 8 2-17 
Skin Skin-Melanoma Malignant melanoma 107 38 69 57 37-78 
ndoderm 
Biliary Biliary-AdenoCA Papillary cholangiocarcinoma 34 15 19 64 53-76 
Bladder Bladder-TCC Transitional cell carcinoma 23 8 15 65 52-80 
Colon/Rectum ColoRect-AdenoCA Adenocarcinoma; Mucinous adeno. 60 30 30 67 46-81 
Oesophagus Eso-AdenoCA Adenocarcinoma 98 14 84 70 56-79 
Liver Liver-HCC Hepatocellular carcinoma; Comb. HCC/cholangio 317 89 228 67 50-78 
Lung Lung-AdenoCA Adenocarcinoma; Adenocarcinoma in situ 38 20 18 66 47-77 
Lung Lung-SCC Squamous cell carcinoma; Basaloid SCC 48 10 38 68 54-77 
Pancreas Panc-AdenoCA Adeno.; Acinar cell Ca.; Mucinous adeno. 239 119 120 67 50-79 
Pancreas Panc-Endocrine Neuroendocrine carcinoma 85 30 55 59 38-75 
Prostate Prost-AdenoCA Adenocarcinoma 210 0 210 59 47-71 
Stomach Stomach-AdenoCA Adenocarcinoma; Mucinous; Papillary; Tubular 75 18 57 65 47-79 
Thyroid Thy-AdenoCA Adenocarcinoma; Columnar cell; Follicular type 48 37 11 51 26-75 
Mesoderm 
Bone/Soft Tissue Bone-Benign Osteoblastoma; Osteofibrous dysplasia 7 4 3 18 12-30 
Bone/Soft Tissue Bone-Benign Chondroblastoma; Chrondromyxoid fibroma 9 16 14-38 
Bone/Soft Tissue Bone-Epith Adamantinoma; Chordoma 10 4 60 37-67 
Bone/Soft Tissue Bone-Osteosarc Osteosarcoma 38 20 18 20 9-58 
Bone/Soft Tissue SoftTissue-Leiomyo Leiomyosarcoma 15 10 5 61 51-78 
Bone/Soft Tissue SoftTissue-Liposarc Liposarcoma 19 5 14. «n/a n/a 
Cervix Cervix-AdenoCA Adenocarcinoma 2 2 39 33-46 
Cervix Cervix-SCC Squamous cell carcinoma 18 18 0 39 25-58 
Head/Neck Head-SCC Squamous cell carcinoma 57 10 47 53 34-71 
Kidney Kidney-ChRCC Adenocarcinoma, chromophobe type 45 19 26 47 34-69 
Kidney Kidney-RCC Clear cell adenocarcinoma; papillary type 144 54 90 60 48-75 
Lymphoid Lymph-BNHL Burkitt; Diffuse large B-cell; Follicular; Marginal 107 51 56 57 10-74 
Lymphoid Lymph-CLL Chronic lymphocytic leukaemia 95 31 64 62 46-78 
Myeloid Myeloid-AML Acute myeloid leukaemia 10 3 vi 50 35-56 
Myeloid Myeloid-MDS Myelodysplastic syndrome 2 1 1 76 74-77 
Myeloid Myeloid-MPN Myeloproliferative neoplasm 26 14 12 56 38-75 
Ovary Ovary-AdenoCA Adenocarcinoma; Serous cystadenocarcinoma 113 113 60 48-74 
Uterus Uterus-AdenoCA _Adeno., endometrioid; Serous cystadeno. 51 51 0 69 57-81 
SE va egy) 
Breast Breast-AdenoCA Infiltrating duct carcinoma; Medullary; Mucinous 198 197 1 56 39-76 
Breast Breast-DCIS Duct micropapillary carcinoma 3 3 0 55 43-60 
Breast Breast-LobularCA — Lobular carcinoma 13 13 0 53 42-69 


Extended Data Table 2 | Ethical considerations of genomic cloud computing 


Ethical Considerations of Genomic Cloud Computing 


The PCAWG project represents the first large-scale use of distributed cloud computing in 
genomics. The project involved the movement of large quantities of personal health 
information across multiple legal jurisdictions and responsible use of this data by several 
hundred international researchers. Donor consents were written to explicitly allow for broad 
research use of the data and for international data sharing. PCAWG was granted permission 
by the leads of each of the tumour data providers to store, analyse and distribute the data on 


academic and/or commercial compute clouds. 


To ensure that the PCAWG personal data were handled in a manner consistent with the donor 
consents, authorised representatives of each of the academic clouds and high-performance 
computing facilities signed a commitment not to access controlled tier data beyond the 
minimum needed to administer it. We negotiated similar contractual terms with commercial 
cloud partners. Prior to accessing the data, each PCAWG researcher was required to obtain 
local Institutional Review Board approval for their proposed analytic projects, and obtained 
controlled tier authorisation from dbGaP (National Center for Biotechnology Information) and 
the ICGC DACO (Centre of Genomics and Policy at McGill University). To handle the data 
securely, we encrypted it while in motion and at rest. We used a central authentication and 
digital token generating system to enforce a strong data access protocol that required 
researchers to provide their TCGA and/or ICGC credentials prior to accessing controlled tier 
data. No data breach or other compromise of donor confidentiality is known to have occurred 


over the course of the PCAWG project, despite its extensive use of cloud computing. 
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Extended Data Table 3 | Scientific output using PCAWG data, in bite-size chunks 


: e Estimated ~10-fold more coding than non-coding driver point mutations. 
Discovery of non- 


: : e@ Variation in point mutation density in non-coding regions influenced more by mutational 4 
coding drivers . 
processes than selection. 
Drivers by pathways e Both coding and non-coding alterations contribute to cancer pathways. 
= fed : : ; 16 
and networks e Some pathways, such as RNA splicing, are primarily driven by non-coding mutations. 


e@ Each tumour type has a distinct pattern of early and late-occurring driver events. 
e Earliest somatic mutations may occur decades prior to diagnosis, providing opportunities 

for early diagnosis. 7 
e Intra-tumour heterogeneity is widespread and tumour subclones contain drivers that are 

under positive selection. 


Timing of cancer 
evolution 


e Replication-based mechanisms of genome rearrangement frequent in many cancers, 


Patterns of structural often causing driver structural variants. 6 
variation e 16 signatures of SV, including break-and-ligate patterns and copy-and-insert patterns, 
varying by size range, replication timing, tumour type and patient. 
eanetanal e 52 regions with recurrent structural breakpoints and 90 recurrently fused pairs of loci 
show evidence of positive selection. 
consequence of : . : a ae es 4 
er @ Oncogenic fusions are shaped by juxtaposition of proto-oncogenes with tissue-specific 
structural variation 
regulatory elements. 
e@ Many flavours of somatic retrotransposition in many cancers: LINE element mobilisation; 
Patterns of transductions, pseudogenes, Alu elements. 10 
retrotransposition e Retrotranspositions can induce genomic instability, including large deletions and 
breakage-fusion-bridge cycles amplifying cancer genes. 
ae e@ Chromothripsis pervasive across cancers, with frequency >50% in several tumour types. 
Chromothripsis ee q Y ° YP 18 


e Replicative processes and templated insertions contribute to rearrangement. 


Signatures of point e >70 distinct mutational signatures, encompassing SNVs, doublet subs and indels. 
mutations e Multiple signatures from unknown processes of DNA damage, repair and replication. 
e Uneven distribution of somatic mutations and structural variants across the genome 

explained by epigenetic state of tissue, cell of origin and topological associated domains. 11,12,15 
@ Can be used to identify a tumour’s type and presumed tissue/cell of origin. 


Mutation distribution 
across genome 


e@ Genomic basis for RNA alterations across ~1200 tumours, including quantitative trait loci, 
RNA effects of somatic allele specific expression and alternative splicing. 
mutation @ Link between mutational signatures and expression; classification of gene fusions; 
identification of genes recurrently altered at RNA level. 


8,9 


Tumour subtypes from @ Genomic distribution of somatic mutations, mutational signatures and driver mutations 


: ee : : , : 12 
genome sequencing accurately distinguish major tumour types of primaries and metastases. 
Mitochondrial DNA e@ Somatic mitochondrial truncating mutations frequent in certain cancer types, associated 14 
mutations with activation of critical signaling pathways. 
: e Activating TERT promoter mutations are the single most frequent non-coding driver. 
Telomere biology and : eee ane: 
@ In ATRX/DAXX-mutant tumours, aberrant telomere variant repeat distribution is 4,13 


sequences 
common. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Data and metadata were collected from International Cancer Genome Consortium (ICGC) consortium members using custom software 
packages designed by the ICGC Data Coordinating Centre. The general-purpose core libraries and utilities underlying this software have 
been released under the GPLv3 open source license as the "Overture" package and are available at https://www.overture.bio. Other data 
collection software used in this effort, such as |CGC-specific portal user interfaces, are available upon request to contact @overture.bio. 


Data analysis The workflows executing core WGS alignment, QC and variant-calling software are packaged as executable Dockstore images and 
available at: https://dockstore.org/search?labels.value.keyword=pcawg&searchMode=files. Individual software components are as 
follows: BWA-MEM vO0.78.8-r455; DELLY v0.6.6; ACEseq v1.0.189; DKFZ somatic SNV workflow v1.0.132-1; Platypus v0.7.4; ascatNgs 
v1.5.2; BRASS v4.012; grass v1.1.6; CaVEMan v1.50; Pindel v1.5.7; ABSOLUTE/JaBbA v1.5; SvABA 2015-05-20; dRanger 2016-03-13; 
BreakPointer 2015-12-22; MuTect v1.1.4; MuSE v1.0rc; SMuFIN 2014-10-26; OxoG 2016-4-28; VAGrENT v2.1.2; ANNOVAR v2014Nov12; 
VariantBAM v2017Dec12; SNV-Merge v2017May26; SV-MERGE v2017Dec12; DKFZ v2016Dec15; Chromothripsis Explorer v1.0. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


WGS somatic and germline variant calls, mutational signatures, subclonal reconstructions, transcript abundance, splice calls and other core data generated by the 
ICGC/TCGA Pan-cancer Analysis of Whole Genomes Consortium are available for download at https://dcc.icgc.org/releases/PCAWG. Additional information on 
accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/. In accordance with the data access policies of the ICGC and TCGA 
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projects, most molecular, clinical and specimen data are in an open tier which does not require access approval. To access potentially identification information, 
such as germline alleles and underlying sequencing data, researchers will need to apply to the TCGA Data Access Committee (DAC) via dbGaP (https:// 
dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the dataset, and to the ICGC Data Access Compliance Office (DACO; http:// 
icgc.org/daco) for the ICGC portion. In addition, to access somatic single nucleotide variants derived from TCGA donors, researchers will also need to obtain dbGaP 
authorization 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size We compiled an inventory of matched tumour/normal whole cancer genomes in the ICGC Data Coordinating Centre. Most samples came 
from treatment-naive, primary cancers, but there were a small number of donors with multiple samples of primary, metastatic and/or 
recurrent tumours. Our inclusion criteria were: (i) matched tumour and normal specimen pair; (ii) a minimal set of clinical fields; and (iii) 
characterisation of tumour and normal whole genomes using IIlumina HiSeq paired-end sequencing reads. 
We collected genome data from 2,834 donors, representing all ICGC and TCGA donors that met these criteria at the time of the final data 
freeze in autumn 2014. No statistical methods were used to predetermine sample size. 


Data exclusions After quality assurance, data from 176 donors were excluded as unusable. Reasons for data exclusions included inadequate coverage, 
extreme bias in coverage across the genome, evidence for contamination in samples and excessive sequencing errors (for example, through 8- 
oxoguanine). Exclusion criteria were pre-determined. 


Replication In order to evaluate the performance of each of the mutation-calling pipelines and determine an integration strategy, we performed a large- 
scale deep sequencing validation experiment. We selected a pilot set of 63 representative tumour/normal pairs, on which we ran the three 
core pipelines, together with a set of 10 additional somatic variant-calling pipelines contributed by members of the SNV Calling Working 
Group. Overall, the sensitivity and precision of the consensus somatic variant calls were 95% (CI90%: 88-98%) and 95% (CI90%: 71-99%) 
respectively for SNVs. For somatic indels, sensitivity and precision were 60% (34-72%) and 91% (73-96%) respectively. Regarding SVs, we 
estimate the sensitivity of the merging algorithm to be 90% for true calls generated by any one caller; precision was estimated as 97.5% - that 
is, 97.5% of SVs in the merged SV call-set have an associated copy number change or balanced partner rearrangement. 


Randomization No randomisation was performed - this was a descriptive study, not an experimental study. 


Blinding No blinding was undertaken - this was a descriptive study, not an experimental study. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
| | Palaeontology |] MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Patient-by-patient clinical data are provided in the marker paper for the PCAWG consortium (Extended Data Table 1 of that 
manuscript). Demographically, the cohort included 1,469 males (55%) and 1,189 females (45%), with a mean age of 56 years 
(range, 1-90 years). Using population ancestry-differentiated single nucleotide polymorphisms (SNPs), the ancestry distribution 
was heavily weighted towards donors of European descent (77% of total) followed by East Asians (16%), as expected for large 
contributions from European, North American and Australian projects. We consolidated histopathology descriptions of the 
tumour samples, using the ICD-0-3 tumour site controlled vocabulary. Overall, the PCAWG data set comprises 38 distinct tumour 
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types. While the most common tumour types are included in the dataset, their distribution does not match the relative 
population incidences, largely due to differences among contributing ICGC/TCGA groups in numbers sequenced. 


Recruitment Patients were recruited by the participating centres following local protocols. Samples obtained had to meet criteria on amount 
of tumour DNA available, meaning that the cohort is potentially somewhat biased towards larger tumours. Otherwise, we 
anticipate no major recruitment biases. 


Ethics oversight The Ethics oversight for the PCAWG protocol was undertaken by the TCGA Program Office and the Ethics and Governance 
Committee of the ICGC. Each individual ICGC and TCGA project that contributed data to PCAWG had their own local 
arrangements for ethics oversight and regulatory alignment. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Open access 


Somatic mutations in cancer genomes are caused by multiple mutational processes, 
each of which generates a characteristic mutational signature’. Here, as part of the 
Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium’ of the International 
Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA), we 
characterized mutational signatures using 84,729,690 somatic mutations from 
4,645 whole-genome and 19,184 exome sequences that encompass most types of 
cancer. We identified 49 single-base-substitution, 11 doublet-base-substitution, 

4 clustered-base-substitution and 17 small insertion-and-deletion signatures. The 
substantial size of our dataset, compared with previous analyses* >, enabled the 
discovery of new signatures, the separation of overlapping signatures and the 
decomposition of signatures into components that may represent associated—but 
distinct-DNA damage, repair and/or replication mechanisms. By estimating the 
contribution of each signature to the mutational catalogues of individual cancer 
genomes, we revealed associations of signatures to exogenous or endogenous 
exposures, as well as to defective DNA-maintenance processes. However, many 
signatures are of unknown cause. This analysis provides a systematic perspective on 
the repertoire of mutational processes that contribute to the development of human 
cancer. 


Somatic mutations in cancer genomes are caused by mutational pro- 
cesses of both exogenous and endogenous origin that operate dur- 
ing the cell lineage between the fertilized egg and the cancer cell’®. 
Each mutational process may involve components of DNA damage or 
modification, DNA repair and DNA replication (which may be normal 
or abnormal), and generates a characteristic mutational signature that 
potentially includes base substitutions, small insertions and deletions 
(indels), genome rearrangements and chromosome copy-number 
changes’. The mutations in an individual cancer genome may have 
been generated by multiple mutational processes, and thus incor- 
porate multiple superimposed mutational signatures. Therefore, to 
systematically characterize the mutational processes that contribute to 


cancer, mathematical methods have previously been used to decipher 
mutational signatures from somatic mutation catalogues, estimate 
the number of mutations that are attributable to each signature in 
individual samples and annotate each mutation class in each tumour 
with the probability that it arose from each signature®”” ””, 

Previous studies of multiple types of cancer have identified more 
than 30 single-base substitution (SBS) signatures, some of known— 
but many of unknown~—aetiologies, some ubiquitous and others rare, 
some part of normal cell biology and others associated with abnormal 
exposures or neoplastic progression? >” ». Genome rearrangement 
signatures have also previously been described”>5 °°, However, the 
analysis of other classes of mutation has been relatively limited?" *. 
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Mutational signature analysis has predominantly used cancer exome 
sequences. However, the many-fold-greater numbers of somatic muta- 
tions in whole genomes provide substantially increased power for 
signature decomposition, enabling the better separation of partially cor- 
related signatures and the extraction of signatures that contribute rela- 
tively small numbers of mutations. Furthermore, technical artefacts and 
differences in sequencing technologies and mutation-calling algorithms 
can themselves generate mutational signatures. Therefore, the uni- 
formly processed and highly curated sets of all classes of somatic muta- 
tions from the 2,780 cancer genomes of the PCAWG project”, combined 
with most other suitable cancer genomes (accession code syn11801889, 
available at https://www.synapse.org/#!Synapse:syn11801889), present 
anotable opportunity to establish the repertoire of mutational signa- 
tures and determine their activities across different types of cancer. The 
timing of these signatures during the evolution of individual cancers and 
the repertoire of signatures of structural variation have been explored 
in other PCAWG analyses****. 


Mutational signature analysis 


The 23,829 samples—which include most types of cancer, and comprise 
the 2,780 PCAWG whole genomes’, 1,865 additional whole genomes and 
19,184 exomes—yielded 79,793,266 somatic SBSs, 814,191 doublet-base 
substitutions (DBSs) and 4,122,233 small indels that were analysed for 
mutational signatures, about 10-fold-more mutations than any previ- 
ous study of which we are aware (syn11801889)°. 

We developed classifications for each type of mutation. For SBSs, the 
primary classification comprised 96 classes (available at https://cancer. 
sanger.ac.uk/cosmic/signatures/SBS) constituted by the 6 base substi- 
tutions C>A, C>G, C>T, T>A, T>C and T>G (in which the mutated base is 
represented by the pyrimidine of the base pair), plus the flanking 5’ and 
3’ bases. In some analyses, two flanking bases 5’ and 3’ to the mutated 
base were considered (producing 1,536 classes) or mutations within 
transcribed genome regions were selected and classified according to 
whether the mutated pyrimidine fell on the transcribed or untranscribed 
strand (producing 192 classes). We also derived a classification for DBSs 
(78 classes; available at https://cancer.sanger.ac.uk/cosmic/signatures/ 
DBS). Indels were classified as deletions or insertions and—when of asingle 
base—as C or T, and according to the length of the mononucleotide repeat 
tract in which they occurred. Longer indels were classified as occurring 
at repeats or with overlapping microhomology at deletion boundaries, 
andaccording to the size of indel, repeat and microhomology (83 classes; 
available at https://cancer.sanger.ac.uk/cosmic/signatures/ID). 

The PCAWG whole-genome sequences, the additional whole-genome 
sequences and the exome sequences were each analysed separately 
(syn11801889). Signatures were extracted from each type of cancer 
individually, from all cancer types together, as separate SBS, DBS and 
indel signatures, and as composite signatures of all three types of muta- 
tion (Supplementary Note 2). 

We used two methods based on nonnegative matrix factorization 
(NMF): SigProfiler, an elaborated version of the framework used for 
the previous ‘Catalogue Of Somatic Mutations In Cancer’ (COSMIC) 
compendium of mutational signatures (COSMIC v.2, available at https:// 
cancer.sanger.ac.uk/cosmic/signatures v2)", and SignatureAnalyzer, 
which is based ona Bayesian variant of NMF*”*>. NMF determines the 
signature profiles and contributions of each signature to each cancer 
genome as part of its factorization of the input matrix of mutation 
spectra. However, with many signatures and/or heterogeneous muta- 
tion burdens across samples, the mutations observed ina particular 
sample can be reconstructed in multiple ways—often with small and/or 
biologically implausible contributions from many signatures. There- 
fore, each method has developed a separate procedure for estimating 
the contributions of signatures to each sample (Methods). 

We tested SignatureAnalyzer and SigProfiler on 11 sets of synthetic 
data (including 64,400 synthetic samples), generated from known 


signature profiles (Methods, Supplementary Note 2). Both methods 
performed well in re-extracting known signatures from realistically 
complex data. Extracted signatures that were discordant from the 
known input usually arose from difficulties in selecting the correct 
number of signatures. The results confirm that use of NMF-based 
approaches for extracting mutational signatures is not a purely algo- 
rithmic process, but also requires consideration of evidence from 
experimentally determined mutational signatures and the DNA dam- 
age and repair literature, prior evidence of biological plausibility and 
human-guided sensitivity analysis confirming that extractions from 
different groupings of tumours yield consistent results. We used these 
types of evidence and approaches in determining the signature pro- 
files reported here. The findings are consistent with results regard- 
ing NMF, and the related areas of probabilistic topic modelling and 
latent Dirichlet allocation, in multiple problem domains**”. It is widely 
understood that the choice of the number of latent variables (for our 
purposes, the number of mutational signatures) is rarely amenable to 
complete automation. 

The results from our SigProfiler and SignatureAnalyzer analyses 
of cancer data exhibited many similarities, and we assigned the same 
identifiers to similar signatures extracted using the two methods 
(syn12016215). However, there were also noteworthy differences. 
The numbers of SBS signatures found in PCAWG tumours with alow 
mutation burden (94.4% of cases that contain 47% of mutations) were 
similar: 31 using SigProfiler and 35 using SignatureAnalyzer. However, 
the numbers of additional SBS signatures extracted from hypermutated 
PCAWG samples (5.6% of cases, containing 53% of mutations) were dif- 
ferent: 13 using SigProfiler and 25 using SignatureAnalyzer. There were 
also differences in SBS signature profiles, including among signatures 
found in cases with alow mutation burden. The latter primarily involved 
relatively featureless (‘flat’) signatures, which are mathematically chal- 
lenging to deconvolute. Finally, there were differences in signature 
attributions to individual samples. SignatureAnalyzer used more sig- 
natures to reconstruct the mutational profiles (Extended Data Fig. 1) 
(syn12169204 and syn12177011) and attributions to flat signatures were 
different (Extended Data Fig. 2a, b) (syn12169204). The DBS and indel 
signatures were generally similar between the two methods (Extended 
Data Fig. 2c, d). 

The final reference mutational signatures were determined from 
the PCAWG set, supplemented by additional signatures from the other 
datasets (COSMIC, available at https://cancer.sanger.ac.uk/cosmic/ 
signatures). Each signature was allocated an identifier consistent with, 
and extending, the COSMIC v.2 annotation. Some previous signatures 
split into multiple constituent signatures: these were numbered as in 
the previous annotation, but with additional letter suffixes (for exam- 
ple, SBS17 was split into SBS17a and SBS17b). DNA sequencing and 
analysis artefacts also generate mutational signatures. We indicate 
which signatures are possible artefacts but do not present them below 
(full information is available at https://cancer.sanger.ac.uk/cosmic/ 
signatures). The results of both SignatureAnalyzer and SigProfiler were 
used throughout the study. However, for brevity and for continuity 
with the signature set previously displayed in COSMIC v.2—which has 
been widely used as a reference—SigProfiler results are outlined here, 
and SignatureAnalyzer results are provided in Extended Data Figs. 3, 
4 and at syn11738307. 


Single-base substitution signatures 


There were substantial differences in the numbers of SBSs between 
samples (ranging from hundreds to millions) and between cancer 
types*s (Fig. 1). In total, 67 SBS mutational signatures were extracted, 
of which 49 were considered likely to be of biological origin (Fig. 2, 
Methods; available at https://cancer.sanger.ac.uk/cosmic/signatures/ 
SBS/). Except for signature SBS25, all signatures reported in COSMIC v.2 
(ref. °) were confirmed; the median cosine similarity between the newly 
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derived signatures and those on COSMIC v.2 was 0.95, excluding the 
‘split’ signatures (discussed below). SBS25 was previously found in cell 
lines derived from Hodgkin lymphomas treated with chemotherapy, 
and no primary cancers of this type were available. The newly derived 
signatures showed much improved separation from each other and 
more-distinct signature profiles, as compared with COSMIC v.2 sig- 
natures (see ‘Better separation compared to COSMIC v.2 signatures’ 
in Supplementary Note 2 for more information). 

Thirteen of the SBS signatures we extracted (excluding those due 
to signature splitting) represent newly identified and probably real 
signatures, not present in COSMIC v.2. Some were rare (SBS31, SBS32, 
SBS35, SBS36, SBS42 and SBS44). Others were more common, but con- 
tributed relatively few mutations and/or were similar to previously 
discovered signatures (SBS38, SBS39 and SBS40). Notably, SBS40 is a 
flat signature similar to SBSS. It contributes to multiple types of can- 
cer, but its similarity to SBS5 renders the extent of this contribution 
uncertain. For some of the newly identified signatures, there were 
plausible underlying aetiologies (Fig. 3, Extended Data Figs. 4, 5): for 
SBS31 and SBS35, platinum compound chemotherapy”; for SBS32, 
azathioprine therapy; for SBS36, inactivating germline or somatic 
mutations in MUTYH (which encodes a component of the base excision 
repair machinery)*°“); for SBS38, additional effects of exposure to ultra- 
violet (UV) light; for SBS42, occupational exposure to haloalkanes”; 
and for SBS44, defective DNA mismatch repair“. 

Three previously characterized base substitution signatures (SBS7, 
SBS10 and SBS17) split into multiple constituent signatures (Fig. 2). 
Signature splitting probably reflects the existence of multiple distinct 
mutational processes initiated by the same exposure that have closely— 
but not perfectly—correlated activities. We previously regarded SBS7 
as asingle signature composed predominantly of C>T at CCN and TCN 
trinucleotides (the mutated base is underlined) together with many 
fewer T>N mutations. It was found in malignant melanomas and squa- 
mous skin carcinomas, and is probably due to the UV-light-induced 
formation of pyrimidine dimers, followed by translesion DNA synthesis 
by error-prone polymerases predominantly inserting A opposite to 
damaged cytosines. SBS7 has now been decomposed into four con- 
stituent signatures. SBS7a and SBS7b (consisting mainly of C>T at TCN 
and C>T at CCN, respectively) may reflect different pyrimidine-dimer 
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photoproducts. SBS7c and SBS7d (consisting predominantly of T>A 
at NTT and T>C at NTT, respectively**) may be due to low frequencies 
of the misincorporation of T and G opposite to thymines in pyrimidine 
dimers. The splitting of SBS10 and SBS17 is described at https://cancer. 
sanger.ac.uk/cosmic/signatures/SBS/. 

Several base substitution signatures showed transcriptional strand 
bias, which may be attributable to transcription-coupled nucleotide 
excision repair acting on DNA damage and/or to an excess of DNA dam- 
age on untranscribed strands of genes**. Both mechanisms result in 
more mutations of damaged bases on untranscribed than on tran- 
scribed strands of genes. Assuming that either mechanism is respon- 
sible for the observed transcriptional strand biases, DNA damage to 
cytosine (SBS7a and SBS7b), guanine (SBS4, SBS8, SBS19, SBS23, SBS24, 
SBS31, SBS32, SBS35 and SBS42), thymine (SBS7c, SBS7d, SBS21, SBS26 
and SBS33) and adenine (SBS5, SBS12, SBS16, SBS22 and SBS25) may 
underlie these mutational signatures (plots of strand bias are available 
at https://cancer.sanger.ac.uk/cosmic/signatures/SBS/). The likely 
DNA-damaging agents are known for SBS4 (tobacco mutagens), SBS7a, 
SBS7b, SBS7c and SBS7d (UV light), SBS22 (aristolochic acid), SBS24 
(aflatoxin), SBS25 (chemotherapy), SBS31 and SBS35 (platinum com- 
pounds), SBS32 (azathioprine) and SBS42 (haloalkanes). 

Using the SBS classification of 1,536 mutation types, which uses the 
sequence context two bases 5’ and two bases 3’ to each mutated base, 
yielded signatures that are largely consistent with those based on sub- 
stitutions in trinucleotide contexts. Notably, however, two forms of 
both SBS2 and SBS13 were extracted, one with mainly a pyrimidine and 
the other with mainly a purine at the —2 base (the second base 5’ to the 
mutated cytosine). These may represent the activities of the cytidine 
deaminases APOBEC3A and APOBEC3B, respectively*. If so, APOBEC3A 
accounts for many more mutations than APOBEC3B in cancers with 
high APOBEC activity. Other signatures showed nonrandom sequence 
contexts at +2 and -2 positions (for example, SBS17a, SBS17b and SBS9), 
but sequence context effects were generally much stronger for bases 
immediately 5’ and 3’ to mutated bases. 

SBS signatures showed substantial variation in the numbers of can- 
cer types and cancer samples in which they were found, and in the 
mutations attributed per cancer sample (Fig. 3). Almost all individual 
cancer samples exhibited multiple signatures, with a mode of three in 
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Fig. 2| Profiles of SBS, DBS and small indel mutational signatures. The 
classifications of each mutation type (SBS, 96 classes; DBS, 78 classes; and 
indels, 83 classes) are described in the main text. Magnified versions of 
signatures SBS4, DBS2 and ID3 (all of which are associated with tobacco 


the PCAWG set (syn12169204). The assigned signatures reconstruct 
well the mutational spectra of the cancer samples (in PCAWG samples, 
the median cosine similarity was 0.97; 96.3% of samples with cosine 
similarity >0.90): Fig. 4 shows illustrative examples. 


smoking) are shown to illustrate the positions of each mutation subtype on 
each plot. The plotted data are available in digital form (along with the x axis 
labels) at syn12025148. 


Some mutational processes generate base substitutions that cluster 
in small genomic regions. The limited numbers of such mutations may 
resultina failure to detect their signatures using standard methods. We 
therefore identified clustered mutations in each genome and analysed 


Nature | Vol578 | 6 February 2020 | 97 


Article 


Fig. 3 | The number of mutations contributed 
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them separately (Methods). Four main clustered mutational signatures 
were identified (Fig. 2), as previously reported*””2. Two, which are found 
in multiple types of cancer, were similar to SBS2 and SBS13 (which have 
been attributed to APOBEC enzyme activity) and represent foci of katae- 
gis***“°, Two further clustered signatures, one characterized by C>T and 
C>G mutations at (A or G)C(C or T) trinucleotides” and the other T>A 
and T>C mutations at (A or T)T(A or T), were found in lymphoid neo- 
plasms; they probably represent the direct and indirect consequences of 
activation-induced cytidine deaminase mutagenesis and translesion DNA 
synthesis by error-prone polymerases (SBS84 and SBS85, respectively)”. 


Doublet-base substitution signatures 


Tandem doublet, triplet, quadruplet, quintuplet and sextuplet base 
substitutions (syn11801938 and syn11726620) were observed at about 
1% the prevalence of SBSs. In most cancer genomes, the number of DBSs 


98 | Nature | Vol578 | 6 February 2020 


e Ultraviolet light exposure 


Possibly APOBEC activity 


Swi 
@O@ Replication slippage; defective DNA mismatch repair 
@ Replication slippage; defective DNA mismatch repair 


e DSB repair by non-homologous end joining 
Defective DNA mismatch repair 


@ © DSB repair by non-homologous end-joining 


Median mutations per 
Mb due to signature 
(among tumours 
with the signature) 


was considerably higher than would be expected from the random 
adjacency of SBSs (syn12177057), indicating the existence of com- 
monly occurring, single mutagenic events that cause substitutions 
at neighbouring bases. There was substantial variation in the number 
of DBSs, ranging from 0 to 20,818 in asample. The numbers of DBSs 
were generally proportional to the numbers of SBSs (Fig. 1), although 
colorectal adenocarcinomas had fewer than expected, and lung cancers 
and melanomas had more (Extended Data Table 1). We extracted eleven 
DBS signatures (Fig. 2, of which three have previously been reported’, 

Signature DBS1 was characterized by CC>TT mutations (Fig. 2), con- 
tributed hundreds to tens of thousands of mutations in malignant mela- 
nomas with SBS7a and SBS7b (Fig. 3), exhibited transcriptional strand 
bias consistent with damage to cytosines (syn12177063) and is aknown 
consequence of DNA damage induced by UV light”. Excluding cancers 
associated with exposure to UV light also yielded a signature (DBS11) 
that was characterized predominantly by CC>TT mutations, but only 
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Fig. 4 | Illustrative examples of mutational spectra of individual cancer samples. The contributory SBS, DBS and small indel mutational signatures in two 


tumours are shown. 


contributing tens of mutations in many samples from multiple types 
of cancer (Figs. 2,3). DBS11 was associated with SBS2, which is due to 
APOBEC activity: APOBEC activity may, therefore, also generate DBSI11. 

DBS2 was composed predominantly of CC>AA mutations, with 
smaller numbers of CC>AG and CC>AT mutations, and contributed 
hundreds to thousands of mutations in lung adenocarcinoma, lung 
squamous and head and neck squamous carcinomas, which are often 
caused by tobacco smoking® (Figs. 2,3). DBS2 showed transcriptional 
strand bias indicative of guanine damage (syn12177064) and was asso- 
ciated with SBS4, which is caused by exposure to tobacco smoke. It is 
likely, therefore, that DBS2 can be a consequence of DNA damage by 
tobacco-smoke mutagens. 

A signature similar to DBS2 contributed hundreds of mutations to 
liver cancers and tens of mutations to other types of cancer without 
evidence of exposure to tobacco smoke. A pattern resembling DBS2 
also dominates DBSs in healthy mouse cells®°. The nature of the muta- 
tional processes that underlie these signatures in human cancers that 
are unrelated to smoking, and in healthy mice, is unknown. However, in 
experimental systems, acetaldehyde exposure has been shown to gen- 
erate a mutational signature characterized primarily by CC>AA muta- 
tions, and lower burdens of CC>AG and CC>AT mutations, together 
with C>A SBSs**. Acetaldehyde is an oxidation product of alcohol anda 
constituent of cigarette smoke. The role of acetaldehyde, and perhaps 
other aldehydes, in generating DBS2 merits further investigation™. 

DBS3, DBS7, DBS8 and DBS10 showed hundreds to thousands of muta- 
tions inrare colorectal, stomach and oesophageal cancers, some of which 
showed evidence of defective DNA mismatch repair (DBS7 and DBS10) or 
polymerase epsilon exonuclease domain mutations (DBS3) that generate 
hypermutator phenotypes (Figs. 2,3). DBS5 was found in cancers exposed 
to platinum chemotherapy, and is associated with SBS31 and SBS35. 


Small insertion-and-deletion signatures 


Indels were usually present at about 10% of the frequency of base 
substitutions (Fig. 1). There was substantial variation between cancer 


genomes inthe number of indels, even when cancers with evidence of 
defective DNA mismatch repair were excluded. Overall, the numbers of 
deletions and insertions were similar, but there was variation between 
cancer types: some cancers showed more deletions and others more 
insertions of various subtypes (Fig. 1). We extracted 17 indel mutational 
signatures (Fig. 2). 

Indel signature 1 (ID1) was composed predominantly of insertions of 
thymine and ID2 was composed predominantly of deletions of thymine, 
both at long (=5) thymine mononucleotide repeats (Fig. 2). Tens to 
hundreds of mutations of both signatures were found in most samples 
of most types of cancer, but were particularly common in colorectal, 
stomach, endometrial and oesophageal cancers and in diffuse large B 
cell lymphoma (Fig. 3). Together, ID1 and ID2 accounted for 97% and 
45% of indels inhypermutated and non-hypermutated cancer genomes, 
respectively (Extended Data Table 2). They are probably due to slip- 
page of either the nascent (ID1) or template strand (ID2) during DNA 
replication of long mononucleotide tracts. 

ID3 was characterized predominantly by deletions of cytosine at 
short (<5-bp long) mononucleotide cytosine repeats and exhibited 
hundreds of mutations in cancers of the lung, head and neck that are 
associated with tobacco smoking (Figs. 2,3). There was transcriptional 
strand bias of mutations, with more guanine deletions than cytosine 
deletions on the untranscribed strands of genes, whichis compatible 
with transcription-coupled nucleotide excision repair of damaged 
guanine (syn12177065 and syn12177066). The numbers of ID3 mutations 
positively correlated with the numbers of SBS4 and DBS2 mutations, 
which we have shown are associated with tobacco smoking (Extended 
Data Figs. 6, 7). Thus, DNA damage by components of tobacco smoke 
probably underlie ID3. 

ID13 was characterized predominantly by deletions of thymine at 
thymine-thymine dinucleotides and exhibited large numbers of muta- 
tions in malignant melanomas of the skin (Figs. 2, 3). The numbers 
of ID13 mutations correlated with the numbers of SBS7a, SBS7b and 
DBS1 mutations, which we have attributed to DNA damage induced 
by UV light (Extended Data Figs. 6, 7). However, deletions of cytosine 
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at cytosine-cytosine dinucleotides did not feature strongly in ID13, 
which may reflect the predominance of thymine compared to cytosine 
dimers induced by UV light. 

ID6 and ID8 were both characterized predominantly by >5-bp dele- 
tions (Fig. 2). ID6 exhibited overlapping microhomology at deletion 
boundaries with a mode of 2 bp (and often longer stretches) and cor- 
related with SBS3, which we have attributed to defective homologous- 
recombination-based repair (Extended Data Figs. 6, 7). By contrast, ID8 
deletions showed shorter or no microhomology at deletion boundaries 
and did not strongly correlate with SBS3. Both deletion patterns may be 
characteristic of DNA double-strand-break repair by non-homologous- 
recombination-based end-joining mechanisms and—if so—this suggests 
that at least two distinct forms are operative in human cancer®. 

Asmall fraction of cancers exhibited very large numbers of ID1 and 
ID2 mutations (>10,000) (Fig. 3) (shown at https://cancer.sanger.ac.uk/ 
cosmic/signatures/ID). These were usually accompanied by SBS6, 
SBS14, SBS15, SBS20, SBS21, SBS26 and/or SBS44, which are associated 
with deficiency in DNA mismatch repair—sometimes combined with 
POLE or POLD1 proofreading deficiency (SBS14 and SBS20)*. Occa- 
sional cases with these signatures additionally showed large numbers 
of indels attributed to ID7 (syn11738668), and rare samples showed 
large numbers of ID4, ID11, ID14, ID15, ID16 or ID17 mutations but did 
not show large numbers of ID1 and ID2 mutations or the SBS signatures 
associated with deficiency in DNA mismatch repair. 


Correlations with age 


A positive correlation between age of cancer diagnosis and the number 
of mutations attributable to a signature suggests that the underlying 
mutational process has been operative (at a more or less constant rate) 
throughout the cell lineage from fertilized egg to cancer cell, and thusin 
the normal cells from which that type of cancer develops*™. Confirming 
previous reports®*, the numbers of SBS1 and SBS5 mutations correlate 
with age, and exhibit different rates in different types of tissue (g val- 
ues provided in syn12030687, syn20317940 and syn12217988). SBS40 
also correlated with age in multiple types of cancer, although—given 
its similarity to SBS5—misattribution cannot be excluded. DBS2 and 
DBS4 correlated with age; consistent with activity innormal cells and, 
when combined their profiles closely resemble the spectrum of DBS 
mutations found in normal mouse cells*. ID1, 1D2, ID5 and ID8 showed 
correlations with age in multiple tissues. ID1 and ID2 indels are probably 
dueto slippage at poly T repeats during DNA replication and correlated 
with the numbers of SBS1 substitutions, which have previously been 
proposed to reflect the number of mitoses a cell has experienced®. 
Thus, SBS1, ID1 and ID2 may all be generated during DNA replication 
at mitosis. The number of ID5 mutations correlated with the number 
of SBS40 mutations, and the mutational processes that underlie these 
two age-correlated signatures may therefore contain commoncompo- 
nents. ID8, which is predominantly composed of >5-bp deletions with 
no or 1bp of microhomology at their boundaries, is probably due to 
DNA double-strand breaks repaired by anon-homologous-end-joining 
mechanism. The results indicate that multiple mutational processes 
operate in normal cells. 


Discussion 

There are important constraints, limitations and assumptions in the 
analytic frameworks used here to characterize mutational signatures. 
Signatures extracted from sample sets in which multiple processes 
are operative remain mathematical approximations, with profiles 
that are potentially influenced by the mathematical approach used 
and other factors. For conceptual and practical simplicity, we assume 
that a single signature is associated with each mutational process and 
provide an average reference signature to represent it. However, we 
do not discount the possibility that further nuances and variations 
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of signature profiles exist. We have estimated the contributions from 
each signature to the mutation burden in each sample. However, with 
increasing numbers of signatures and differences of multiple orders 
of magnitude in mutation burdens between some signatures, prior 
knowledge has helped to avoid biologically implausible results. Thus, 
the further development of methods for deciphering and attributing 
mutational signatures is warranted, ideally supported by signatures 
derived from experimental systems in which the causes are known. 
Nevertheless, signatures with many similarities and some differences 
can be found by different mathematical approaches, and these can 
be confirmed in several ways, including experimentally elucidated 
signatures>?)??*?435+ © and tumours dominated by a single signature 
(syn12016215). 

This analysis includes most publicly available exome and whole- 
genome cancer sequences. Some rare or geographically restricted 
signatures may not have been captured, signatures conferring limited 
mutation burdens may have been missed and signatures of therapeutic 
mutagenic exposures have not been exhaustively explored. Neverthe- 
less, itis likely that a substantial proportion of the naturally occurring 
mutational signatures found in human cancer have now been described. 
This comprehensive repertoire provides a foundation for research into 
the aetiologies of geographical and temporal differences in cancer 
incidence, the mutational processes that operate in healthy tissues 
and non-neoplastic disease states, clinical and public health applica- 
tions of signatures and mechanistic understanding of the mutational 
processes that underlie carcinogenesis. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 

These online methods contain an abridged description of the meth- 
odology used in the current manuscript; extensive details about the 
methodology we used are provided in Supplementary Note 2. Impor- 
tantly, two independently developed computational frameworks (Sig- 
Profiler and SignatureAnalyzer) based on NMF were applied separately 
to the examined sets of mutational catalogues. SigProfiler and Signa- 
tureAnalyzer take different approaches for deciphering mutational 
signatures and for assigning each signature to each sample. By using 
two methods, we aimed to provide a perspective on the effect that dif- 
ferent methodologies can have onthe numbers of signatures generated, 
signature profiles and attributions. In addition to applying SigProfiler 
and SignatureAnalyzer to cancer data, the tools were also applied to 
realistic synthetic data with known solutions. 


Analysis of mutational signatures with SigProfiler 

SigProfiler incorporates two distinct steps for identification of muta- 
tional signatures, based on the previously described methodology®” 
(Extended Data Fig. 8). The first step (SigProfilerExtraction) encom- 
passes a hierarchical de novo extraction of mutational signatures based 
onsomatic mutations and their immediate sequence context, andthe 
second step (SigProfilerAttribution) focuses on accurately estimating 
the number of somatic mutations associated with each extracted muta- 
tional signature in each sample. SigProfilerExtraction is an extension 
of aprevious framework for the analysis of mutational signatures”. In 
brief, for a given set of mutational catalogues, the algorithm deciphers 
a minimal set of mutational signatures that optimally explains the 
proportion of each mutation type and estimates the contribution of 
each signature to each sample. More specifically, for each NMF itera- 
tion, SigProfilerExtraction minimizes a generalized Kullback-Leibler 
divergence constrained for nonnegativity (Supplementary Note 2). The 
algorithm uses multiple NMF iterations (in most cases 1,024) to identify 
the matrix of mutational signatures and the matrix of the activities of 
these signatures, as previously described”. The unknown number of 
signatures is determined by human assessment of the stability and 
accuracy of solutions for a range of values, as previously described”. 
The framework is applied hierarchically to increase its ability to find 
mutational signatures that generate few mutations or are present in 
few samples. 

After signatures are discovered by SigProfilerExtraction, SigPro- 
filerAttribution estimates their contributions to individual samples. 
For each examined sample, the estimation algorithm involves finding 
the minimum of the Frobenius norm of aconstrained function using a 
nonlinear convex optimization programming solver using the interior- 
point algorithm”. See Supplementary Note 2 and Extended Data Fig. 
8b for further details. 


Analysis of mutational signatures with SignatureAnalyzer 

SignatureAnalyzer uses a Bayesian variant of NMF that infers the num- 
ber of signatures through the automatic relevance determination tech- 
nique and delivers highly interpretable and sparse representations for 
both signature profiles and attributions that strike a balance between 
data fitting and model complexity. Further details of the actual imple- 
mentation of the computational approach have previously been pub- 
lished®””**, SignatureAnalyzer was applied by using a two-step signature 
extraction strategy using 1,536 pentanucleotide contexts for SBSs, 83 
indel features and 78 DBS features. In addition to the separate extrac- 
tion of SBS, indel and DBS signatures, we performed a ‘COMPOSITE’ 
signature extraction based on all 1,697 features (1,536 SBS + 78 DBS + 
83 indel). For SBSs, the 1,536 SBS COMPOSITE signatures are preferred; 
for DBSs and indels, the separately extracted signatures are preferred. 


Instep 1 ofthe two-step extraction process, global signature extrac- 
tion was performed for the samples with a low mutation burden 
(n=2,624). These excluded hypermutated tumours: those with puta- 
tive polymerase epsilon (POLE) defects or mismatch repair defects 
(microsatellite instable tumours), skin tumours (which had intense 
UV-light mutagenesis) and one tumour with temozolomide (TMZ) 
exposure. Because the underlying algorithm of SignatureAnalyzer 
performs a stochastic search, different runs can produce different 
results. In step 1, we ran SignatureAnalyzer 10 times and selected the 
solution with the highest posterior probability. In step 2, additional 
signatures unique to hypermutated samples were extracted (again 
selecting the highest posterior probability over ten runs) while allow- 
ing all signatures found in the samples with low mutation burden, to 
explain some of the spectra of hypermutated samples. This approach 
was designed to minimize a well-known ‘signature bleeding’ effect ora 
bias of hyper- or ultramutated samples on the signature extraction. In 
addition, this approach provided information about which signatures 
are unique to the hypermutated samples, which was later used when 
attributing signatures to samples. 

Asimilar strategy was used for signature attribution: we performeda 
separate attribution process for low- and hypermutated samples in all 
COMPOSITE, SBS, DBS and indel signatures. For downstream analyses, 
we preferred to use the COMPOSITE attributions for SBSs and the sepa- 
rately calculated attributions for DBSs and indels. Signature attribu- 
tioninsamples with alow mutation burden was performed separately 
in each tumour type (for example, Biliary-AdenoCA, Bladder-TCC, 
Bone-Osteosarc, and so on). Attribution was also performed separately 
inthe combined microsatellite instable tumours (n=39), POLE (n=9), 
skin melanoma (n=107) and TMZ-exposed samples (syn11738314). In 
both groups, signature availability (which signatures were active, or 
not) was primarily inferred through the automatic relevance determi- 
nation process applied to the activity matrix H only, while fixing the 
signature matrix W. The attribution in samples with a low mutation 
burden was performed using only signatures found in the step 1 of the 
signature extraction. Two additional rules were applied in SBS signature 
attribution to enforce biological plausibility and minimize a signature 
bleeding: (i) allow SBS4 (smoking signature) only inlung, head and neck 
cases; and (ii) allow SBS11 (TMZ signature) ina single GBM sample. This 
was enforced by introducing a binary, signature-by-sample signature 
indicator matrix Z (1, allowed; 0, not allowed), which was multiplied by 
the H matrix in every multiplication update of H. No additional rules 
were applied to indel or DBS signature attributions, except that sig- 
natures found in hypermutated samples were not allowed in samples 
with alow mutation burden. 


Application of SigProfiler and SignatureAnalyzer to synthetic 
data 

Our goal was to evaluate SignatureAnalyzer and SigProfiler on real- 
istic synthetic data to identify any potential limitations of these two 
methods. SignatureAnalyzer and SigProfiler were tested on 11 sets of 
synthetic data, encompassing a total of 64,400 synthetic samples, in 
which known signature profiles were used to generate catalogues of 
synthetic mutational spectra. We operationally defined ‘realistic’ data 
as those based on the characteristics of either SignatureAnalyzer’s or 
SigProfiler’s analysis of the PCAWG genome data. SignatureAnalyzer’s 
reference signature profiles were based on COMPOSITE signatures, 
consisting of 1,536 types of strand-agnostic SBSs in pentanucleotide 
context, 78 types of DBSs and 83 types of small indels, for a total of 
1,697 mutation types. SigProfiler’s reference analysis was based on 
strand-agnostic SBSs in the context of one 5’ and one 3’ base. For each 
test, we generated two sets of realistic data: SigProfiler-realistic (based 
on SigProfiler’s reference signatures and attributions) and Signature- 
Analyzer-realistic (based on SignatureAnalyzer’s reference signatures 
and attributions), as well as two other types of data that involved using 
SignatureAnalyzer profiles with SigProfiler attributions and vice versa. 


A detailed description of each of the 11 sets of synthetic data and the 
results from applying SigProfiler and SignatureAnalyzer are provided 
in Supplementary Note 2. 


Analysis of clustered mutational signatures 

Somatic SBSs were considered clustered if they had intermutational 
distances <1,000 bp. More specifically, for each sample, an SBS muta- 
tional catalogue was generated for substitutions that were <1,000 bp 
from another substitution. Subsequently, the set of SBS mutational 
catalogues containing clustered mutations underwent de novo extrac- 
tion of mutational signatures. Any novel mutational signature (one 
that was not previously observed in the complete SBS catalogues) was 
reported as a clustered mutational signature. 


Better separation compared to COSMIC v.2 signatures 

As described in the manuscript, all mutational signatures previously 
reported in COSMIC v.2 were confirmed in the newset of analyses with 
median cosine similarity of 0.95. However, the separation between 
the COSMIC v.2 mutational signatures (https://cancer.sanger.ac.uk/ 
cosmic/signatures_v2) is much worse than the separation between 
the mutational signatures reported here. For example, in COSMIC v.2, 
signatures 5 and 16 had a cosine similarity of 0.90, making them hard 
to distinguish from one another. By contrast, in the current analysis, 
SBS5 and SBS16 have a cosine similarity of 0.65. This allows us to unam- 
biguously assign SBSS and SBS16 to different samples. In the current 
analysis, the larger number of samples has allowed the reduction of 
bleeding between signatures and has given more unique and easily 
distinguishable signatures. One can evaluate the overall separation of 
aset of mutational signatures by examining the distribution of cosine 
similarities between the signatures in the set. The signatures in COS- 
MIC v.2 had a median cosine similarity of 0.238. By contrast, the current 
signatures have a much lower median cosine similarity of 0.098. This 
twofold reduction in similarity is highly statistically significant (Pvalue 
9.1x 10) and indicates a better separation between the signatures in 
the current analysis. 


Correlations of mutational signature activity with age 

Before evaluating the association between age and the activity of a 
mutational signature, all outliers for both age and numbers of muta- 
tions attributed to a signature in a cancer type were removed from 
the data. An outlier was defined as any value outside three standard 
deviations from the mean value. A robust linear regression model that 
estimated the slope of the line and whether this slope was significantly 
different from zero (F test; P value < 0.05) was performed using the 
MATLAB function robustfit (https://www.mathworks.com/help/stats/ 
robustfit.html) with default parameters. The Pvalues from the F tests 
were corrected using the Benjamini-Hochberg procedure for false 
discovery rates. Results are available at syn12030687 and syn20317940. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 

Somatic and germline variant calls, mutational signatures, sub- 
clonal reconstructions, transcript abundance, splice calls and other 
core data generated by the ICGC and TCGA PCAWG Consortium are 
described in ref. *, and are available for download at https://dcc.icgc. 
org/releases/PCAWG. Additional information on accessing the data, 
including raw read files, can be found at https://docs.icgc.org/pcawg/ 
data/. In accordance with the data access policies of the ICGC and TCGA 
projects, most molecular, clinical and specimen data are in an open 
tier that does not require access approval. To access information that 
could potentially identify participants, such as germline alleles and 


the underlying sequencing data, researchers will need to apply to the 
TCGA data access committee via dbGaP (https://dbgap.ncbi.nIm.nih. 
gov/aa/wga.cgi?page=login) for access to the TCGA portion of the 
dataset, and tothe ICGC data access compliance office (http://icgc.org/ 
daco) for the ICGC portion of the dataset. In addition, to access somatic 
single nucleotide variants derived from TCGA donors, researchers 
will also need to obtain dbGaP authorization. For each mutational 
signature as extracted by SigProfiler, there is a ‘vignette’ that con- 
sists of plots and a short textual description at COSMIC (available at 
https://cancer.sanger.ac.uk/cosmic/signatures/). Beyond the core 
sequence data generated by the ICGC and TCGA PCAWG Consortium, 
other derived datasets were generated by the research reported in this 
paper. These derived datasets are available at Synapse (https://www. 
synapse.org/#!Synapse:syn11726601/wiki/513478), and are denoted by 
accession numbers (synXXXXXXXxX). All these datasets are mirrored 
at https://dcc.icgc.org/releases/PCAWG/mutational_signatures/ with 
fulllinks, filenames, accession numbers and descriptions as detailed in 
Supplementary Table 1. These datasets include (1) CSV files comprising 
all catalogues of observed mutational spectra that were used as input 
to signature extraction (syn11801889), (2) CSV files and plots of signa- 
tures extracted by SigProfiler (syn11738306) and SignatureAnalyzer 
(syn11738307), (3) CSV files with estimates of the numbers of mutations 
generated by each signature in individual tumours (syn11804065), (4) 
estimates of the probability that each signature was responsible for 
each mutational type (for example, CTG>CAG) in individual tumours 
(syn11804068) and (5) synthetic test input data plus the results of 
tests of signature extraction (discovery) on the synthetic test data 
(syn18497223). All derived datasets are open access, and can be down- 
loaded without registration or logging in. 


Code availability 


SigProfiler is available both as a MATLAB framework and as a Python 
package. In both cases, SigProfiler is a fully functional, free and open- 
source tool distributed under the permissive 2-Clause BSD License. 
SigProfiler in MATLAB can be downloaded from: https://www.math- 
works.com/matlabcentral/fileexchange/38724-sigprofiler. SigProfiler 
in Python can be downloaded from: https://github.com/Alexandrov- 
Lab/SigProfilerExtractor. SignatureAnalyzer code is available at https:// 
github.com/broadinstitute/getzlab-SignatureAnalyzer (github.com). 
The code used to generate the synthetic data and summarize Signatu- 
reAnalyzer and SigProfiler results is open source and freely available as 
the SynSig package: https://github.com/steverozen/SynSig/tree/v0.2.0 
under the GNU General Public License v.3.0. The core computational 
pipelines used by the PCAWG Consortium for alignment, quality control 
and variant calling are available to the public at https://dockstore.org/ 
search?search=pcawg under the GNU General Public License v.3.0, 
which allows for reuse and distribution. 
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Extended Data Fig. 1| Histogram of the number of signatures attributed in each of 2,780 PCAWG samples by SigProfiler and SignatureAnalyzer. 
Hypermutated tumours and melanomas (156) are listed at syn11738314. 
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Extended Data Fig. 2| Comparisons between results of SigProfiler and 
SignatureAnalyzer. a, b, Comparison of the attributions for corresponding 
SigProfiler (a) and SignatureAnalyzer (b) signatures. Each one of the SBS 
signatures extracted by SigProfiler and SignatureAnalyzer was paired with the 
signature of highest cosine similarity in the extraction by the other method (if 
one with >0.85 cosine similarity exists). The first column of the plot 
corresponds to the fraction of mutations assigned by one method (summed 
across samples and mutation types) that was also assigned by the other 
method. The remaining mutations were then redistributed to the other 
signatures in the extraction, weighted by their relative probabilities of having 
been generated by each signature and the resulting fraction of mutations was 
then plotted. Signatures on thex axis are shown only if they contribute at least a 


0.1 fraction of mutations to at least one signature on they axis. c,d, Cosine 
similarities between SigProfiler and SignatureAnalyzer DBS (c) and indel (d) 
signatures. Brown nodes represent SigProfiler signatures; green nodes 
represent SignatureAnalyzer signatures. Matches with cosine 

similarities > 0.8 are shown as edges; the width of the edge indicates the 
strength of the similarity. The locations of the nodes have no meaning. 
Signatures with no matches of >0.8 cosine similarity are shown below. 
SigProfiler ID15 and 1ID17 were extracted from data that were not analysed by 
SignatureAnalyzer. The suffix ‘P’ ona SignatureAnalyzer signature name 
indicates a signature extracted from non-hypermutated, non-melanoma 
tumours. The suffix ‘S’ ona SignatureAnalyzer signature name indicates a 
signature extracted from hypermutated or melanoma tumours. 
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Extended Data Fig. 4 | The number of SBS mutations attributed to each mutational signature for each cancer type over the PCAWG tumours by 
SignatureAnalyzer. Conventions are as in Fig. 3; see this figure for explanation. 
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Extended Data Fig. 5 | The number of SBS mutations attributed to each mutational signature to each cancer type over the complete set of PCAWG and non- 
PCAWG cancer samples analysed by SigProfiler. Conventions are as in Fig. 3; see this figure for explanation. 
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Extended Data Table 1 | The number of DBSs is proportional to the number of SBSs, with few exceptions 


Covariate (including Coefficient Coefficient Tumour 
. tvalue  Pr(> |t|) 
cancer type) estimate std. error count 

(Intercept) 5.60E+00 8.80E+01 0.1 0.9 NA 
SBS.count 3.70E-03 1.30E-04 29.8 <2e-16 NA 
Biliary-AdenoCA (reference) 35 
Bladder-TCC 1.30E+01 1.40E+02 0.1 0.9 23 
Bone-Benign -6.30E+00 1.60E+02 0 1 16 
Bone-Epith 2.20E+00 1.80E+02 0 1 11 
Bone-Osteosarc 2.20E+00 1.20E+02 0 1 38 
Breast-AdenoCA 6.20E+00 9.50E+01 0.1 0.9 198 
Breast-DCIS 4.20E+00 3.10E+02 0 1 3 
Breast-LobularCA -8.20E+00 1.70E+02 0 1 13 
Cervix-AdenoCA -7.90E+00 3.80E+02 0 1 2 
Cervix-SCC -1.10E+01 1.50E+02 -0.1 0.9 18 
CNS-GBM -2.80E+01 1.20E+02 -0.2 0.8 41 
CNS-Medullo -7.00E+00 9.80E+01 -0.1 0.9 146 
CNS-Oligo -1.00E+01 1.50E+02 -0.1 0.9 18 
CNS-PiloAstro -5.90E+00 1.00E+02 -0.1 1 89 
ColoRect-AdenoCA -4.10E+02 1.10E+02 -3.7 3.00E-04 60 
Eso-AdenoCA -1.60E+01 1.00E+02 -0.2 0.9 98 
Head-SCC 5.30E+01 1.10E+02 0.5 0.6 57 
Kidney-ChRCC -3.10E+00 1.20E+02 0 1 45 
Kidney-RCC 5.60E+01 9.80E+01 0.6 0.6 144 
Liver-HCC 7.80E+01 9.20E+01 0.8 0.4 326 
Lung-AdenoCA 5.00E+02 1.20E+02 4.1 4.00E-05 38 
Lung-SCC 5.80E+02 1.20E+02 5.1 4.00E-07 48 
Lymph-BNHL 1.00E+01 1.00E+02 0.1 0.9 107 
Lymph-CLL -4,30E+00 1.00E+02 0 1 95 
Myeloid-AML -1.90E+00 1.80E+02 0 1 11 
Myeloid-MDS -8.00E+00 2.70E+02 0 1 4 
Myeloid-MPN -7.40E+00 1.10E+02 -0.1 0.9 56 
Ovary-AdenoCA 3.60E+01 1.00E+02 0.4 0.7 113 
Panc-AdenoCA -8.30E-01 9.40E+01 0 1 241 
Panc-Endocrine -5.70E+00 1.00E+02 -0.1 1 85 
Prost-AdenoCA 2.50E+00 9.30E+01 0 1 286 
Skin-Melanoma 1.70E+03 1.00E+02 16.5 <2e-16 107 
SoftTissue-Leiomyo 6.00E+00 1.60E+02 0 1 15 
SoftTissue-Liposarc 7.80E+00 1.50E+02 0.1 1 19 
Stomach-AdenoCA -3.00E+01 1.10E+02 -0.3 0.8 75 
Thy-AdenoCA -4.80E+00 1.20E+02 0 1 48 
Uterus-AdenoCA -1.20E+02 1.10E+02 -1.1 0.3 51 


The exceptions are colorectal adenocarcinoma (Colorect-AdenoCA), lung adenocarcinoma (Lung-AdenoCA), lung squamous cell carcinoma (Lung-SCC) and skin-melanoma, as analysed by 
the following linear regression (computed by an R function call): glm(DBS.count ~ SBS.count + Cancer.Type). This function call fits a model in which the number of DBSs depends linearly on the 
number of SBSs and on the cancer type. P values associated with the coefficients are two-sided. 
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Extended Data Table 2 | Numbers of insertion and deletion mutations due to ID1, ID2 and all other indel signatures in 
hypermuted and non-hypermutated tumours 


Non- 
Hypermutators All Tumours 
hypermutators 
Signature Count Fraction Count’ Fraction Count Fraction 
ID1 593,935 0.236 399,633 0.276 993,568 0.250 


ID2 1,838,867 0.730 252,893 0.174 2,091,760 0.527 


ID1+ID2)=—s- 2,432,802 0.966 652,526 0.450 3,085,328 0.777 


Other ID 


: 85,038 0.034 797,964 0.550 883,002 0.223 
signatures 


Total 2,517,840 1 1,450,490 1 3,968,330 1 
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n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


a The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection The data in this study were those reported in https://www.biorxiv.org/content/early/2017/07/12/162784.full.pdf+html (the PCAWG 
marker paper) and in the publications cited at https://www.synapse.org/#!Synapse:syn11801788. 


For the larger PCAWG Consortium, data and metadata were collected from International Cancer Genome Consortium (ICGC) consortium 
members using custom software packages designed by the ICGC Data Coordinating Centre. The general-purpose core libraries and 
utilities underlying this software have been released under the GPLv3 open source license as the "Overture" package and are available at 
https://www.overture.bio. Other data collection software used in this effort, such as |CGC-specific portal user interfaces, are available 
upon request to contact@overture.bio. 


Data analysis SigProfiler is available both as a MATLAB framework and as a Python package. In both cases, SigProfiler is fully functional, free, and open- 
source tool distributed under the permissive 2-Clause BSD License. SigProfiler in MATLAB can be downloaded from: https:// 
www.mathworks.com/matlabcentral/fileexchange/38724-sigprofiler SigProfiler in Python can be downloaded from: https://github.com/ 
AlexandrovLab/SigProfilerExtractor. SignatureAnalyzer code is available at https://www.synapse.org/#!Synapse:syn11801492. The code 
used to generate the synthetic data and summarize SignatureAnalyzer and SigProfiler results is open-source and freely available as the 
SynSig package: https://github.com/steverozen/ SynSig/tree/v0.2.0 under the GPL3 license. 


For the larger PCAWG Consortium, the workflows executing core WGS alignment, QC and variant-calling software are packaged as 
executable Dockstore images and available at: https://dockstore.org/search?labels.value.keyword=pcawg&searchMode=files. Individual 
software components are as follows: BWA-MEM v0.78.8-r455; DELLY v0.6.6; ACEseq v1.0.189; DKFZ somatic SNV workflow v1.0.132-1; 
Platypus v0.7.4; ascatNgs v1.5.2; BRASS v4.012; grass v1.1.6; CaVEMan v1.50; Pindel v1.5.7; ABSOLUTE/JaBbA v1.5; SVABA 2015-05-20; 
dRanger 2016-03-13; BreakPointer 2015-12-22; MuTect v1.1.4; MuSE v1.0rc; SMuFIN 2014-10-26; OxoG 2016-4-28; VAGrENT v2.1.2; 
ANNOVAR v2014Nov12; VariantBAM v2017Dec12; SNV-Merge v2017May26; SV-MERGE v2017Dec12; DKFZ v2016Dec15 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


Derived data are available at https://www.synapse.org/#!Synapse:syn11726601/wiki/513478. All figures and extended data figures have associated raw data at that 
site. 


For the larger PCAWG Consortium, WGS somatic and germline variant calls, mutational signatures, subclonal reconstructions, transcript abundance, splice calls and 
other core data generated by the ICGC/TCGA Pan-cancer Analysis of Whole Genomes Consortium are available for download at https://dcc.icgc.org/releases/ 
PCAWG. Additional information on accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/. In accordance with the data 
access policies of the ICGC and TCGA projects, most molecular, clinical and specimen data are in an open tier which does not require access approval. To access 
potentially identification information, such as germline alleles and underlying sequencing data, researchers will need to apply to the TCGA Data Access Committee 
(DAC) via dbGaP (https://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the dataset, and to the ICGC Data Access Compliance 
Office (DACO; http://icgc.org/daco) for the ICGC portion. In addition, to access somatic single nucleotide variants derived from TCGA donors, researchers will also 
need to obtain dbGaP authorization. 


Field-specific reporting 
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x] Life sciences Behavioural & social sciences [| Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size From a statistical perspective this was an exploratory study, and there were no pre-defined hypothesis tests for which sample-size power 
calculations would have been appropriate. The sample size was determined by numbers of tumour genomes and exomes represented by 
publicly available somatic mutation data. These data consisted of the ICGC Pan Cancer whole genome mutation data, the TCGA MC3 whole 
exome mutation data, and additional mutation data as described in https://www.synapse.org/#!Synapse:syn11801788. This was an 
unsupervised analysis, and therefore we extracted as many signatures as possible from all the available data. This enabled a substantial 
increment over previously available sets of mutational signatures, especially with respect to double base substitution (DBS) signatures and 
insertion/deletion (ID) signatures. 


For the larger PCAWG Consortium, the Consortium compiled an inventory of matched tumour/normal whole cancer genomes in the ICGC 
Data Coordinating Centre. Most samples came from treatment-naive, primary cancers, but there were a small number of donors with multiple 
samples of primary, metastatic and/or recurrent tumours. Our inclusion criteria were: (i) matched tumour and normal specimen pair; (ii) a 
minimal set of clinical fields; and (iii) characterisation of tumour and normal whole genomes using Illumina HiSeq paired-end sequencing 
reads. 

We collected genome data from 2,834 donors, representing all ICGC and TCGA donors that met these criteria at the time of the final data 
freeze in autumn 2014. 


Data exclusions From a statistical perspective this was an exploratory study, and there were no pre-defined hypothesis tests for which pre-defined data 
exclusion criteria would have been appropriate. Therefore, no data were excluded from analysis by our algorithms. 


For the larger PCAWG Consortium, after quality assurance, data from 176 donors were excluded as unusable. Reasons for data exclusions 
included inadequate coverage, extreme bias in coverage across the genome, evidence for contamination in samples and excessive sequencing 
errors (for example, through 8-oxoguanine). 


Replication This was not an experimental study, and there were no experimental replicates. 


For the larger PCAWG Consortium, in order to evaluate the performance of each of the mutation-calling pipelines and determine an 
integration strategy, we performed a large-scale deep sequencing validation experiment. We selected a pilot set of 63 representative tumour/ 
normal pairs, on which we ran the three core pipelines, together with a set of 10 additional somatic variant-calling pipelines contributed by 
members of the SNV Calling Working Group. Overall, the sensitivity and precision of the consensus somatic variant calls were 95% (CI90%: 
88-98%) and 95% (CI90%: 71-99%) respectively for SNVs. For somatic indels, sensitivity and precision were 60% (34-72%) and 91% (73-96%) 
respectively. Regarding SVs, we estimate the sensitivity of the merging algorithm to be 90% for true calls generated by any one caller; 
precision was estimated as 97.5% - that is, 97.5% of SVs in the merged SV call-set have an associated copy number change or balanced 
partner rearrangement. 


Randomization | There were no experimental groups in this study; the question of allocation to experimental groups is not applicable. 


For the larger PCAWG Consortium, no randomisation was performed. 
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Blinding There was no allocation to experimental groups; the question of whether investigators were blinded to allocation is not applicable. 


For larger PCAWG Consortium, no blinding was undertaken. 


Behavioural & social sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description Briefly describe the study type including whether data are quantitative, qualitative, or mixed-methods (e.g. qualitative cross-sectional, 
quantitative experimental, mixed-methods case study). 


Research sample State the research sample (e.g. Harvard university undergraduates, villagers in rural India) and provide relevant demographic information 
(e.g. age, sex) and indicate whether the sample is representative. Provide a rationale for the study sample chosen. For studies involving 
existing datasets, please describe the dataset and source. 
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Sampling strategy Describe the sampling procedure (e.g. random, snowball, stratified, convenience). Describe the statistical methods that were used to 
predetermine sample size OR if no sample-size calculation was performed, describe how sample sizes were chosen and provide a rationale 
for why these sample sizes are sufficient. For qualitative data, please indicate whether data saturation was considered, and what criteria 
were used to decide that no further sampling was needed. 


Data collection Provide details about the data collection procedure, including the instruments or devices used to record the data (e.g. pen and paper, 
computer, eye tracker, video or audio equipment) whether anyone was present besides the participant(s) and the researcher, and whether 
the researcher was blind to experimental condition and/or the study hypothesis during data collection. 


Timing Indicate the start and stop dates of data collection. If there is a gap between collection periods, state the dates for each sample cohort. 


Data exclusions If no data were excluded from the analyses, state so OR if data were excluded, provide the exact number of exclusions and the rationale 
behind them, indicating whether exclusion criteria were pre-established. 


Non-participation State how many participants dropped out/declined participation and the reason(s) given OR provide response rate OR state that no 
participants dropped out/declined participation. 


Randomization If participants were not allocated into experimental groups, state so OR describe how participants were allocated to groups, and if 
allocation was not random, describe how covariates were controlled. 


Ecological, evolutionary & environmental sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description Briefly describe the study. For quantitative data include treatment factors and interactions, design structure (e.g. factorial, nested, 
hierarchical), nature and number of experimental units and replicates. 


Research sample Describe the research sample (e.g. a group of tagged Passer domesticus, all Stenocereus thurberi within Organ Pipe Cactus National 
Monument), and provide a rationale for the sample choice. When relevant, describe the organism taxa, source, sex, age range and 
any manipulations. State what population the sample is meant to represent when applicable. For studies involving existing datasets, 
describe the data and its source. 


Sampling strategy Note the sampling procedure. Describe the statistical methods that were used to predetermine sample size OR if no sample-size 
calculation was performed, describe how sample sizes were chosen and provide a rationale for why these sample sizes are sufficient. 


Data collection Describe the data collection procedure, including who recorded the data and how. 
Timing and spatial scale | /ndicate the start and stop dates of data collection, noting the frequency and periodicity of sampling and providing a rationale for 
these choices. If there is a gap between collection periods, state the dates for each sample cohort. Specify the spatial scale from which 


the data are taken 


Data exclusions If no data were excluded from the analyses, state so OR if data were excluded, describe the exclusions and the rationale behind them, 
indicating whether exclusion criteria were pre-established. 


Reproducibility Describe the measures taken to verify the reproducibility of experimental findings. For each experiment, note whether any attempts to 
repeat the experiment failed OR state that all attempts to repeat the experiment were successful. 


Randomization Describe how samples/organisms/participants were allocated into groups. If allocation was not random, describe how covariates were 
controlled. If this is not relevant to your study, explain why. 


Blinding Describe the extent of blinding used during data acquisition and analysis. If blinding was not possible, describe why OR explain why 
blinding was not relevant to your study. 


Did the study involve field work? Yes [| No 


Field work, collection and transport 


Field conditions Describe the study conditions for field work, providing relevant parameters (e.g. temperature, rainfall). 
Location State the location of the sampling or experiment, providing relevant parameters (e.g. latitude and longitude, elevation, water 
depth). 


Access and import/export Describe the efforts you have made to access habitats and to collect and import/export your samples in a responsible manner and 
in compliance with local, national and international laws, noting any permits that were obtained (give the name of the issuing 
authority, the date of issue, and any identifying information). 


Disturbance Describe any disturbance caused by the study and how it was minimized. 
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Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
|__| Eukaryotic cell lines |__| Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Describe all antibodies used in the study; as applicable, provide supplier name, catalog number, clone name, and lot number. 


Validation Describe the validation of each primary antibody for the species and application, noting any validation statements on the 
manufacturer’s website, relevant citations, antibody profiles in online databases, or data provided in the manuscript. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) State the source of each cell line used. 
Authentication Describe the authentication procedures for each cell line used OR declare that none of the cell lines used were authenticated. 
Mycoplasma contamination Confirm that all cell lines tested negative for mycoplasma contamination OR describe the results of the testing for 


mycoplasma contamination OR declare that the cell lines were not tested for mycoplasma contamination. 


Commonly misidentified lines ame any commonly misidentified cell lines used in the study and provide a rationale for their use. 
(See ICLAC register) 


Palaeontology 


Specimen provenance Provide provenance information for specimens and describe permits that were obtained for the work (including the name of the 
issuing authority, the date of issue, and any identifying information). 


Specimen deposition Indicate where the specimens have been deposited to permit free access by other researchers. 
Dating methods If new dates are provided, describe how they were obtained (e.g. collection, storage, sample pretreatment and measurement), 


where they were obtained (i.e. lab name), the calibration program and the protocol for quality assurance OR state that no new 
dates are provided. 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals For laboratory animals, report species, strain, sex and age OR state that the study did not involve laboratory animals. 


Wild animals Provide details on animals observed in or captured in the field; report species, sex and age where possible. Describe how animals 
were caught and transported and what happened to captive animals after the study (if killed, explain why and describe method; if 
released, say where and when) OR state that the study did not involve wild animals. 


Field-collected samples For laboratory work with field-collected samples, describe all relevant parameters such as housing, maintenance, temperature, 
photoperiod and end-of-experiment protocol OR state that the study did not involve samples collected from the field. 


Ethics oversight Identify the organization(s) that approved or provided guidance on the study protocol, OR state that no ethical approval or 
guidance was required and explain why not. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Human research participants 


Policy information about studies involving human research participants 


Population characteristics For the PCAWG Consortium data, patient-by-patient clinical data are provided in the marker paper for the PCAWG consortium 
(Extended Data Table 1 of that manuscript). Demographically, the cohort included 1,469 males (55%) and 1,189 females (45%), 
with a mean age of 56 years (range, 1-90 years). Using population ancestry-differentiated single nucleotide polymorphisms 
(SNPs), the ancestry distribution was heavily weighted towards donors of European descent (77% of total) followed by East 
Asians (16%), as expected for large contributions from European, North American and Australian projects. We consolidated 
histopathology descriptions of the tumour samples, using the ICD-O-3 tumour site controlled vocabulary. Overall, the PCAWG 
data set comprises 38 distinct tumour types. While the most common tumour types are included in the dataset, their 
distribution does not match the relative population incidences, largely due to differences among contributing ICGC/TCGA groups 
in numbers sequenced. The non-PCAWG analyses used previously published data. 


Recruitment For the PCAWG Consortium data, patients were recruited by the participating centres following local protocols. 
Ethics oversight For the PCAWG Consortium data, the Ethics oversight for the PCAWG protocol was undertaken by the TCGA Program Office and 


the Ethics and Governance Committee of the ICGC. Each individual ICGC and TCGA project that contributed data to PCAWG had 
their own local arrangements for ethics oversight and regulatory alignment. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Clinical data 


Policy information about clinical studies 
All manuscripts should comply with the ICMJE guidelines for publication of clinical research and a completed CONSORT checklist must be included with all submissions. 


Clinical trial registration Provide the trial registration number from ClinicalTrials.gov or an equivalent agency. 

Study protocol Note where the full trial protocol can be accessed OR if not available, explain why. 

Data collection Describe the settings and locales of data collection, noting the time periods of recruitment and data collection. 
Outcomes Describe how you pre-defined primary and secondary outcome measures and how you assessed these measures. 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links For "Initial submission" or "Revised version" documents, provide reviewer access links. For your "Final submission" document, 
May remain private before publication. provide a link to the deposited data. 

Files in database submission Provide a list of all files available in the database submission. 

Genome browser session Provide a link to an anonymized genome browser session for "Initial submission" and "Revised version" documents only, to 


(e.g. UCSC) enable peer review. Write "no longer applicable" for "Final submission" documents. 


Methodology 


Replicates Describe the experimental replicates, specifying number, type and replicate agreement. 


Sequencing depth Describe the sequencing depth for each experiment, providing the total number of reads, uniquely mapped reads, length of 
reads and whether they were paired- or single-end. 


Antibodies Describe the antibodies used for the ChIP-seq experiments; as applicable, provide supplier name, catalog number, clone 
name, and lot number. 


Peak calling parameters Specify the command line program and parameters used for read mapping and peak calling, including the ChIP, control and 
index files used. 


Data quality Describe the methods used to ensure data quality in full detail, including how many peaks are at FDR 5% and above 5-fold 
enrichment. 
Software Describe the software used to collect and analyze the ChIP-seq data. For custom code that has been deposited into a 
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community repository, provide accession details. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a ‘group’ is an analysis of identical markers). 


| _|All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Describe the sample preparation, detailing the biological source of the cells and any tissue processing steps used. 
Instrument Identify the instrument used for data collection, specifying make and model number. 
Software Describe the software used to collect and analyze the flow cytometry data. For custom code that has been deposited into a 


community repository, provide accession details. 


Cell population abundance __| Describe the abundance of the relevant cell populations within post-sort fractions, providing details on the purity of the samples 
and how it was determined. 


Gating strategy Describe the gating strategy used for all relevant experiments, specifying the preliminary FSC/SSC gates of the starting cell 
population, indicating where boundaries between "positive" and "negative" staining cell populations are defined. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


Magnetic resonance imaging 


Experimental design 


Design type Indicate task or resting state; event-related or block design. 


Design specifications Specify the number of blocks, trials or experimental units per session and/or subject, and specify the length of each trial 
or block (if trials are blocked) and interval between trials. 


Behavioral performance measures _| State number and/or type of variables recorded (e.g. correct button press, response time) and what statistics were used 
to establish that the subjects were performing the task as expected (e.g. mean, range, and/or standard deviation across 
subjects). 


Acquisition 
Imaging type(s) 
Field strength 


Sequence & imaging parameters 


Area of acquisition 


Diffusion MRI Used 


Preprocessing 


Preprocessing software 


Normalization 


Normalization template 


Noise and artifact removal 


Volume censoring 


Statistical modeling & inference 


Model type and settings 


Effect(s) tested 


Specify: functional, structural, diffusion, perfusion. 
Specify in Tesla 


Specify the pulse sequence type (gradient echo, spin echo, etc.), imaging type (EPI, spiral, etc.), field of view, matrix size, 
slice thickness, orientation and TE/TR/flip angle. 


State whether a whole brain scan was used OR define the area of acquisition, describing how the region was determined. 


Not used 


Provide detail on software version and revision number and on specific parameters (model/functions, brain extraction, 
segmentation, smoothing kernel size, etc.). 


If data were normalized/standardized, describe the approach(es): specify linear or non-linear and define image types 
used for transformation OR indicate that data were not normalized and explain rationale for lack of normalization. 


Describe the template used for normalization/transformation, specifying subject space or group standardized space (e.g. 
original Talairach, MNI305, ICBM152) OR indicate that the data were not normalized. 


Describe your procedure(s) for artifact and structured noise removal, specifying motion parameters, tissue signals and 
physiological signals (heart rate, respiration). 


Define your software and/or method and criteria for volume censoring, and state the extent of such censoring. 


Specify type (mass univariate, multivariate, RSA, predictive, etc.) and describe essential details of the model at the first 
and second levels (e.g. fixed, random or mixed effects; drift or auto-correlation). 


Define precise effect in terms of the task or stimulus conditions instead of psychological concepts and indicate whether 
ANOVA or factorial designs were used. 


Specify type of analysis: Whole brain ROI-based [J Both 


Statistic type for inference 
(See Eklund et al. 2016) 


Correction 


Models & analysis 


n/a | Involved in the study 


Graph analysis 


Specify voxel-wise or cluster-wise and report all relevant parameters for cluster-wise methods. 


Describe the type of correction and how it is obtained for multiple comparisons (e.g. FWE, FDR, permutation or Monte 
Carlo). 


Functional and/or effective connectivity 


Multivariate modeling or predictive analysis 


Functional and/or effective connectivity Report the measures of dependence used and the model details (e.g. Pearson correlation, partial 


Graph analysis 


correlation, mutual information). 


Report the dependent variable and connectivity measure, specifying weighted graph or binarized graph, 
subject- or group-level, and the global and/or node summaries used (e.g. clustering coefficient, efficiency, 
etc.). 


Multivariate modeling and predictive analysis Specify independent variables, features extraction and dimension reduction, model, training and evaluation 


metrics. 
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The discovery of drivers of cancer has traditionally focused on protein-coding 
genes’ *. Here we present analyses of driver point mutations and structural variants in 
non-coding regions across 2,658 genomes from the Pan-Cancer Analysis of Whole 
Genomes (PCAWG) Consortium of the International Cancer Genome Consortium 
(ICGC) and The Cancer Genome Atlas (TCGA). For point mutations, we developed a 
statistically rigorous strategy for combining significance levels from multiple 
methods of driver discovery that overcomes the limitations of individual methods. 
For structural variants, we present two methods of driver discovery, and identify 
regions that are significantly affected by recurrent breakpoints and recurrent somatic 
juxtapositions. Our analyses confirm previously reported drivers®’, raise doubts 
about others and identify novel candidates, including point mutations in the 5’ region 
of TP53, inthe 3’ untranslated regions of NFKB/Zand TOBL, focal deletions in BRD4 and 
rearrangements in the loci of AKRIC genes. We show that although point mutations 
and structural variants that drive cancer are less frequent in non-coding genes and 
regulatory sequences than in protein-coding genes, additional examples of these 
drivers will be found as more cancer genomes become available. 


Previous large-scale sequencing projects have identified many putative 
cancer genes, but most efforts have concentrated on mutations and 
copy-number alterations in protein-coding genes, mainly using whole- 
exome sequencing and single-nucleotide polymorphism arrays! +. 
Whole-genome sequencing has made it possible to systematically 
survey non-coding regions for potential driver events, including 


single-nucleotide variants (SNVs), small insertions and deletions 
(indels) and larger structural variants. Whole-genome sequencing 
enables the precise localization of structural variant breakpoints and 
connections between distinct genomic loci (juxtapositions). Although 
previous whole-genome sequencing analyses of modestly sized cohorts 


have revealed candidate non-coding regulatory driver events®*”, 


The list of affiliations appears at the end of the paper. 
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the frequency and functional implications of these events remain 
understudied®”"°”, 

Driver identification remains a far greater challenge in non-coding 
regions than in coding genes, owing to sequencing and mapping arte- 
facts, poorly understood localized hypermutation processes*7*””, 
incomplete annotation of regulatory regions, inaccurate estimation 
of the background mutation rate and the unknown functional effect 
of non-coding mutations. The discovery of drivers from structural 
variants is further complicated by their sparsity, the lack of obvious 
neutral events to build background models and their complex func- 
tional effects. Adequate statistical methods that address these issues 
are needed to reliably identify non-coding drivers. 

The ICGC and TCGA PCAWG effort, which has collected and system- 
atically analysed cancer genome sequences from 2,658 patients across 
38 types of cancer’, offers an opportunity to characterize putative 
non-coding driver events that cannot be found using data from whole- 
exome sequencing or single-nucleotide polymorphism arrays. Here we 
describe acomprehensive search for non-coding somatic drivers. For 
point mutations (SNVs and indels), we combine results from multiple 
driver-discovery algorithms and, by carefully evaluating the signifi- 
cant hits, reveal that recurrent artefacts and poorly understood muta- 
tional processes have led to common false positives among previously 
reported non-coding drivers. For structural variants, we introduce two 
new methods for identifying both regions with significantly recurrent 
breakpoints (SRBs) and with significantly recurrent juxtapositions 
(SRJs), accounting for genomic heterogeneity in the rates of DNA break 
and repair and the three-dimensional architecture of the genome. 
Finally, to assess the potential for future non-coding driver discoveries, 
we quantify our statistical power in the PCAWG dataset and estimate 
the overall excess of point mutations in non-coding regulatory regions 
around known cancer genes. 


Hotspot mutations across cancer types 

Many protein-coding driver mutations occur in single-site ‘hotspots’. In 
the PCAWG dataset, only 12 single-nucleotide positions were mutated 
in>1%, and 106 in>0.5%, of patients (Extended Data Fig. la, Methods). 
Although protein-coding regions span only about 1% of the genome, 15 
out of 50 (30%) of the most frequently mutated sites were well-studied 
hotspots in cancer genes (KRAS, BRAF, PIK3CA, TP53 and IDHI1) (Fig. 1a, 
Extended Data Fig. 1b), along with the two canonical TERT promoter 
hotspots°’. 

The remaining non-coding hotspots could be attributed to the 
following localized mutational processes associated with passenger 
events: (i) damage from ultraviolet (UV) light and impaired nucleotide 
excision repair in melanoma at sites occupied by transcription fac- 
tors*"8°: (ii) somatic hypermutation by activation-induced cytosine 
deaminase (AID) in B-cell non-Hodgkin lymphoma (Lymph-BNHL) 
and chronic lymphocytic leukaemia (Lymph-CLL); (iii) palindromic 
sequence contexts believed to form hairpin DNA structures targeted 
by APOBEC enzymes (in an intron of GPR126 (also known as ADGRG6) 
and the PLEKHS1 promoter)"; and (iv) presumed technical artefacts 
(Fig. 1a, Supplementary Note 1). These findings suggest that—besides 
TERT promoter events—non-coding single-site hotspot drivers are 
infrequent or fall in regions with low sensitivity to detect mutations. 


Discovery of point-mutation drivers 

To identify recurrently mutated genomic elements, we first analysed 
somatic SNVs and indels in protein-coding regions, RNA genes (long 
and short non-coding RNAs and microRNAs (miRNAs)), and regula- 
tory regions (promoters, 5’ untranslated regions (UTRs), 3’ UTRs and 
enhancers), totalling about 4% of the genome (Extended Data Fig. 2a-c, 
Methods, Supplementary Table 1). We analysed 2,583 tumours from 
27 individual tumour types, and 15 meta-cohorts that grouped cancers 


by tissue of origin or organ system (Extended Data Fig. 2d, Methods). 
We identified candidate drivers—that is, cohort-element combinations 
with Q<0.1(10% false discovery rate (FDR))—by integrating 13 discov- 
ery algorithms, circumventing biases introduced by any one method 
(Extended Data Figs. 2e, 11, Supplementary Tables 2, 3, Supplementary 
Note 2). We benchmarked this approach by evaluating its ability to 
detect 603 knowncancer genes (from the Cancer Gene Census (CGC)”, 
v.80), and found that combining methods improved performance 
compared to single algorithms (Extended Data Fig. 3a, b, Methods). 
Overall, we identified 1,294 significant hits that involved 520 unique 
candidates (Supplementary Tables 4, 5). 


Filtering the significant hits 


Even after conservative FDR control, false-positive ‘driver’ loci can 
remain, owing to inaccurate background models, sequencing and map- 
ping artefacts, or local increases in mutations due to unaccounted-for 
mutational processes. We therefore systematically filtered the candi- 
date driver elements on the basis of technical and biological criteria, 
followed by careful review (Extended Data Fig. 3c, Methods, Supple- 
mentary Note 3). Examples of filtered elements include the promot- 
ers of P/M1 (lymphoid tumours) and RPL13A (melanoma) because of 
associations with localized AID and UV-light mutational processes, 
respectively; PLEKHS1, GPR126, TBCID12 and LEPROTL1 because of 
palindromic APOBEC target sequences”””; and the WDR745’ UTR and 
promoter®”°“, owing to mapping problems detected in downstream 
manual review (Supplementary Table 5, Supplementary Note 4). In 
combination, filtering and reapplying FDR control discarded 589 out of 
1,294 (46%) of the original cohort-element hits and 341 out of 520 (66%) 
unique elements (Extended Data Fig. 3c, Supplementary Tables 4, 5). 


Candidate coding and non-coding drivers 


Our stringent combination and filtering strategy yielded 705 hits 
in 179 genomic elements: 602 hits in 143 protein-coding genes and 
103 hits in non-coding elements. We observed wide variability across 
different types of cancer, from one hit in clear-cell renal cancer to 
80 inthe pan-cancer meta-cohort (Fig. 1b, Supplementary Tables 4,5). 
Although most candidate drivers gained significance in larger meta- 
cohorts, some genes—such as DAXX (pancreatic endocrine tumour), 
NRAS (melanoma), SPOP (prostate adenocarcinoma), FGFRI1 (pilocytic 
astrocytoma) and M/R142 (Lymph-BNHL)-scored higher in individual 
tumour types (Extended Data Fig. 3d). These results emphasize the 
trade-off between limiting driver discovery analyses to particular types 
of tumour and maximizing cohort size. 

The candidate coding drivers we identified agreed with previous 
results: of the 143 genes that were significant in at least 1 cohort, 69% 
are in the CGC and nearly all have previously been implicated in can- 
cer. In contrast to large whole-exome sequencing datasets, the fewer 
patients per cancer type inthis dataset provided power sufficient only 
to detect genes with the strongest signal. We found 116 additional hits 
in 84 unique elements that were ‘near significance’ (0.1< Q< 0.25). 
Fifty-one per cent of the 63 unique protein-coding genes in this set 
are in the CGC, which suggests that they would have been discovered 
in larger cohorts (Supplementary Table 4). 

Tonominatea significant non-coding element as a candidate driver, 
we reviewed the supporting evidence from the mutation calls, addi- 
tional genomic data (chromosomal breakpoints, copy number, loss- 
of-heterozygosity and expression data), cancer gene databases and the 
literature (Methods, Supplementary Tables 6, 10). We describe the key 
candidates below, and in Supplementary Note 4. 

The TERT promoter was the most frequently mutated non-coding 
driver in this dataset (14 cohorts) (Fig. 1b), and these mutations were 
strongly associated with higher TERT expression, as has previously 
been reported? (Extended Data Fig. 4a, Supplementary Table 10). 
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Fig. 1| Non-coding point mutations in PCAWG. a, The bar chart (left) shows 
the total number of patients across PCAWG with mutations at a particular 
genomic hotspot (chromosome:position). The top 25 hotspots are grouped as 
known drivers or induced by mutational processes. The table (middle) shows 
the frequency of mutations across a subset of PCAWG cohorts. Lymphoid 
malignancies comprise Lymph-BNHL and Lymph-CLL. The stacked bar chart 
(right) shows the contribution of mutational processes to the hotspot 
mutations (Methods). Gene names are given when hotspots overlap functional 
elements (colour-coded), with amino acid (AA) alterations for protein-coding 
genes (solidus denotes substitution with any one of the indicated amino acids). 
Extended Data Fig. 1b shows the top 50 hotspots, and all cohorts. b, Significant 
non-coding elements (Q< 0.1 of Brown’s combined Pvalues of up to 13 driver 


Mutations in the promoter and/or 5’ UTR of MTG2 (which encodes 
a GTPase involved in the mitochondrial ribosome) were associated 
with an expression of M7G2that was marginally significantly lower, in 
both the pan-cancer (P= 0.036, fold difference = 0.8) and carcinoma 
(P=0.029, fold difference = 0.8) meta-cohorts (Extended Data Figs. 4a, 
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discovery methods; Methods) identified before manual review in cohorts with 
at least one hit. Colour represents significance levels. Details are provided in 
Supplementary Table 5. *Potential technical artefact; #targets affected by 
mutational processes. AdenoCA, adenocarcinoma; CNS, central nervous 
system; Eso, oesophageal; GBM, glioblastoma; HCC, hepatocellular carcinoma; 
Medullo, medulloblastoma; Panc, pancreatic; Prost, prostate; RCC, renal cell 
carcinoma; Repr., reproductive organs; SCC, squamous cell carcinoma; TCC, 
transitional cell carcinoma; Thy, thyroid. HISTIH2AM is also known as H2ACI7; 
Ala.TGC as TRA-TGC3-1; Met.CAT as TRM-CATI-1; and Gly.GCC as TRG-GCC2-3. 
PTDSS1/MTERF3 denotes that 5’ UTR mutations in PTDSS1 also overlap the 
MTERF3 promoter. 


5a). Mutations in the 5’ UTR have previously been shown to decrease 
MTG2 expression in vitro” 

Recurrent somatic events were identified in the 3’ UTRs of TOB1 
(carcinoma and pan-cancer meta-cohorts), NFKBIZ (lymphomas) and 
ALB (liver cancer) (Fig. 1b). TOB1 encodes an anti-proliferation regulator 
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Fig. 2| Newly identified non-coding driver candidates and localized 
transcription-associated mutational process. a, Recurrent mutations and 
associated gene expression in the highly conserved 7OB13’ UTR. Tracks 
showing conservation score (PhyloP, grey), miRNA-binding sites (TargetScan 
(top track) and Ago-Clip (bottom track)), and observed SNVs (blue) and indels 
(green). Expression of TOB1 in mutated (n= 13) and wild-type (n= 886) cases 
(right). Pvalue based on two-sided Wilcoxon rank-sum test. Bars represent 
means. CNA, copy-number alteration. b, Indels and SNVs overlapping the 7P53 
5’ region and their effect on gene expression. H3K4me3 from the GM12878 cell 
line (ENCODE). Event numbers match with gene expression in the right panel 
(red dot, mutated sample; black bar, median). Pvalue represents Fisher’s 
combination of permutation tests within each tumour type. ChRCC, 
chromophobe renal cell carcinoma; FPKM, fragments per kilobase of 


that associates with ERBB2, and also affects migration and invasion in 
gastric cancer”. TOB1 regulates other mRNAs through binding to their 
3’UTRand promoting deadenylation™. Tumours with 3’ UTR mutations 
in TOB1 showed atrend towards decreased expression (P= 0.033, fold 
difference =0.7). The mutations did not concentrate in known miRNA- 
binding sites; however, the region is extremely conserved and thus 
probably functional (Fig. 2a). TOB1 and its neighbouring gene WFIKKN2 
are focally amplified in breast cancer and pan-cancer, suggesting a 
complex role in cancer (Extended Data Fig. 4b). NFKB/Zis a transcrip- 
tion factor that is mutated in diffuse large B cell lymphoma and ampli- 
fied in primary lymphomas”. Mutations in the 3’ UTR accumulated 
in a hotspot proximal to the stop codon and upstream of conserved 
miRNA-binding sites (Extended Data Fig. Sb). The enrichment of indels 
next to the stop codon suggests that this hotspot is not due to AID off- 
target activity. Previous functional experiments have associated these 
mutations with increased NFKBIZ expression”, which we observed in 
our lymphoma cohort (P= 0.035, fold difference = 3.2; after correction 
for copy number, P= 0.03) (Extended Data Fig. 5b). 

Boththe exon and promoter of the non-coding RNA RMRP were sig- 
nificantly mutated in multiple types of cancer (Fig. 1b, Extended Data 
Fig. Sc). Germline RMRP mutations cause cartilage—-hair hypoplasia, 
and previous in vitro studies have shown that some somatic promoter 
mutations are functional’®. The RMRP locus is also focally amplified in 
several types of tumour (Extended Data Fig. 4b). The enrichment of 
mutations in sites that can affect secondary structure suggests that 
these mutations are functional (P= 0.011, permutation test) (Extended 


transcript per million mapped reads. c, Overall pan-cancer distribution of 
indels and SNVs in ALB, NEAT1 and MALAT1 genomic loci (lymphoid tumour 
samples were excluded owing to AID). d, Quantification of average indel rates 
for genes with significantly mutated 3’ UTRs. Error bars represent 95% binomial 
confidence intervals. e, Contribution of indels of different sizes in: all protein- 
coding and long non-coding RNA genes; ALB; NEATI; MALATI; MIR122; and the 
remaining genes enriched in 2-5-bp indels. f, SNV and indel rates (total events 
per Mb per patient) in different functional regions of 18 protein-coding genes 
enriched in 2-5-bp indels (without ALB, which contributed 47% of indels). Red 
lines indicate background indel and SNV rates estimated from all protein- 
coding genes. Error bars as ind; raw counts provided in Supplementary 

Table 18. c-f, Mutations analysed in all unique cases (n=2,583). 


Data Fig. 5c), although caution is required because this locus also 
appears to be affected by mapping artefacts or increased mutation 
rates (Supplementary Note 4). 

The miR-142 precursor miRNA was significant in Lymph-BNHL and 
the lymphatic and haematopoietic cohorts (Fig. 1b; Extended Data 
Fig. 5d). The locus is a known AID off-target region in lymphoma””*, 
but 7 out of 8 mutations in the mature miRNA mir-142-p3-—-for which 
the largest functional effect is expected-—-were not assigned to AID, 
which suggests that these mutations are under selection”. 


Unbiased genome-wide driver screen 


To test whether we missed drivers by focusing on functionally anno- 
tated regions, we applied an unbiased genome-wide survey to all non- 
overlapping 2-kb windows for excess point mutations. Twenty-two of 
the resulting 67 significant windows overlap with known protein-coding 
drivers, and 28 overlap highly transcribed regions with an excess of 
2-5-bp indels (described in the ‘Transcription-associated indel signa- 
ture’ section below) (Extended Data Fig. 5e, Supplementary Table 9, 
Supplementary Note 5). The remaining 17 windows have no obvious 
link to cancer, and several appear to be affected by mapping artefacts. 
Aseparate analysis of 4,351 ultra-conserved non-coding regions did not 
yield new candidate drivers (Extended Data Fig. 5e, Supplementary 
Note 5). Both screens suggest that the paucity of non-coding point- 
mutation drivers found in this study is not due to the annotation of 
functional elements. 
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Increasing power for known cancer genes 


Finally, we performed restricted hypothesis testing to boost the sta- 
tistical power to detect cis-regulatory driver mutations near cancer 
genes from the CGC” (Supplementary Table 7). Restricted hypothesis 
testing of cancer gene promoters revealed a significant recurrence of 
TP53 promoter mutations (11 patients in pan-cancer, Q=0.044), mostly 
comprising SNVs and deletions that affect the transcription start site 
or donor splice site of the first non-coding exon. In 10 out of 11 cases, 
the mutation occurred in combination with loss-of-heterozygosity, 
and all samples with expression data showed decreased mRNA levels 
(Fig. 2b). None of these patients contained additional coding mutations 
that could instead be responsible for the downregulation of TP53. To 
our knowledge, this is the first report of a relatively infrequent—but 
impactful—form of TP53 inactivation by non-coding mutations. 

Focal gains or losses in cancer are selected for modulating expres- 
sion levels of their target genes. Restricting the hypothesis testing to 
the non-coding elements of such genes (n = 216,986 cohort-element 
combinations, representing 5,201 unique elements) (Methods) yielded 
only one new hit, the 3’ UTR of the oncogene FOXA1 in prostate can- 
cer (Supplementary Table 11). 


Transcription-associated indel signature 


Several significant non-coding elements (the ALB 3’ UTR, NEAT1, 
MALAT1 and MIR122) were hit by many indels; all have previously 
been reported to be mutated in cancer!” (Figs. 1b, 2c). To explore 
whether ALB 3’ UTR events are under selection, we calculated indel rates 
across the functional regions of this gene. The indel rate is notably high 
throughout the UTRs, introns and exons, and even downstream of the 
polyadenylation site—a pattern inconsistent with selection (Fig. 2c, d). 
Similarly, FOXA/ has high indel rates throughout its locus, whereas the 
indels in NFKBIZ and TOB1/ are in their 3’ UTRs, suggesting that these 
are driver events (Fig. 2d). ALB, NEAT1 and MALATI mutations were not 
associated with changes in gene expression (Extended Data Fig. 4a) 
and were not associated with high cancer cell fractions or biallelic loss 
(Extended Data Fig. 6a, b). Likewise, indels in M/R122 were downstream 
of the mature miRNA, and were not associated with altered expression 
of the targets of this miRNA (Supplementary Note 5). 

If the indels in these genes were due to a mutational process rather 
than selection, they might exhibit distinct features. Indeed, indels in 
NEAT1, MALATI, MIR122 and ALB were strongly enriched in 2—5-bp-long 
events (Fisher’s P< 6.8 x 10°, for all) (Fig. 2e). A systematic search of 
coding and non-coding genes with significantly (Q < 0.1) increased 
rates of 2-5-bp indels revealed that this mutational process affects at 
least 18 additional genes in different types of tumour, most of whichare 
highly expressed and tissue-specific (as has previously been reported 
for some of these genes”) (Extended Data Fig. 6e, f). Although less 
enriched, SNVs also occur at high frequencies in these regions (Fig. 2f). 
Overall, our findings suggest that the indels in MALAT1, NEAT1, ALB and 
MIR122are not driver events and are the result of atranscription-asso- 
ciated mutational process. The previously reported oncogenic effect 
of altered MALAT1 and NEATI expression” ”’ may thus be unrelated 
to these mutations. Our findings also suggest that although FOXA1 
protein-coding indels are drivers, 3’ UTR indels might be passengers”. 


Breakpoints at driver and fragile sites 


Driver structural variants may act by disrupting one or both of their 
breakpoint loci (for example, deactivating a tumour suppressor), or 
by generating a novel juxtaposition between loci. We thus searched 
both for genomic regions with SRBs and for pairs of regions with SRJs 
(Extended Data Fig. 7). 

For SRBs, we first defined a background model to predict breakpoint 
density, using eight explanatory variables (Methods, Supplementary 
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Table 13) and accounting for unexplained sources of variation” (Sup- 
plementary Note 6). We identified 53 disjoint regions with SRBs (Q< 0.1) 
(Fig.3a, Supplementary Table 14), whichcleanly divided into two groups 
onthe basis of the variability of the breakpoints at the other side of the 
rearrangements. Eight SRBs had partner breakpoints that were tightly 
clustered (had low rearrangement dispersion scores; Methods) and 
represented known oncogenic fusions. The remaining 45 SRBs had 
dispersed partner breakpoints (had high rearrangement dispersion 
scores), and were largely associated with previously identified somatic 
copy-number alterations (SCNAs) (Fig. 3b). 

It has been difficult to distinguish recurrent driver SCNAs from 
passenger events at fragile sites*. At the resolution afforded by 
whole-genome sequencing, late replication timing predicted fra- 
gility-associated SRBs better than existing fragile site annotations 
(Supplementary Note 7), identifying 12 fragile-like SRBs (Fig. 3b). The 
remaining 33 SCNA-like SRBs comprised 14 amplifications, 8 deletions 
and 11 copy-neutral events (Supplementary Table 14). 

The different classes of SRB were associated with different effects 
on neighbouring genes. Five of the eight deletion-associated SRBs 
were associated with biallelic inactivation of nearby known tumour 
suppressors, compared to none of the 12 fragile-like SRBs (P= 0.039) 
(Extended Data Fig. 8a). The fragile-like SRBs were furthest from tissue- 
matched enhancers and caused the weakest expression changes, con- 
sistent with them being passenger events”. By contrast, fusion-like SRBs 
were closer to tissue-matched enhancers than the other SRBs (P< 0.01) 
(Extended Data Fig. 8b) and were associated with greater changes in 
expression than all other SRBs except amplifications (P< 0.05 for all 
types) (Extended Data Fig. 8c, Methods). Our analyses indicate that 
SRB driver events can be classified using rearrangement dispersion 
scores, replication timing and gene expression. Notably, neither rear- 
rangement dispersion scores nor association with replication time can 
be accurately determined from microarrays or whole-exome sequenc- 
ing, which highlights the importance of whole-genome sequencing. 
Altogether, we identified SRBs at 34 sites of known oncogenic fusions 
and recurrent SCNAs, 5 additional sites that are probably due to DNA 
fragility and 14 novel driver candidates (Supplementary Note 8). 


Novel structural-variant driver candidates 


Although most SCNA-like SRBs act by altering gene copy numbers, 
several appeared to target regulatory elements. We identified three 
that were significantly (Q< 0.05) associated with expression changes 
of nearby genes after controlling for copy number (Methods), two of 
which we discuss here. The first comprised structural variants at 10p15, 
which were associated with a greater than twofold upregulation of 
AKRIC1, AKR1C2and AKRIC3 in seven cases of lung squamous cell car- 
cinoma and twocases of liver hepatocellular carcinoma (Extended Data 
Fig. 8d). AKRIC proteins are aldo-keto reductases involved in steroid 
homeostasis. Ectopic expression transforms cell lines, and germline 
mutations have previously been linked to an increased risk of develop- 
ing lung cancer****, Three-quarters of the breakpoints are near (<10 kb) 
lineage-specific enhancers, potentially altering promoter-enhancer 
interactions (and hence gene expression). However, because the high- 
est density of breakpoints lies between two long inverted repeats, the 
structural variants may have been induced by DNA secondary structure. 

The second SRB contains recurrent microdeletions (<50 kb) involving 
the 5’ end of BRD4 in ovarian (eight cases, P< 107’) and breast tumours 
(six cases, P< 0.04) (Fig. 3c, Extended Data Fig. 8e). These deletions 
were highly enriched in cancers that amplified a segment that includes 
BRD4 and NOTCH3(P<0.004) (Fig. 3d, Extended Data Fig. 8f) but were 
not a direct consequence of these amplifications (Supplementary 
Note 9). BRD4is achromatin regulator and a therapeutic target in sev- 
eral types of cancer®*”®, including ovarian and triple-negative breast 
cancer®”*’, Given the increased copy number of the full BRD4 gene, we 
would expect increased gene expression. However, the microdeletions 
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Fig. 3 | Significantly recurrent breakpoints and juxtapositions. a, Relative 
enrichment (Fisher’s exact test) for events per tumour type for the 20 most- 
significant SRBs (circle size). Loci are labelled by the likely driver gene from the 
CGC. For gene symbols separated by a solidus, both or either of the genes are 
intended. b, Rearrangement dispersion score versus mean replication timing 
of the 53 SRBs. Colours indicate fusion (purple), fragile-like (green), deletion 
(blue), amplification (red) or copy-neutral (black) events. c, Tumour-to-normal 
read coverage ratio in an ovarian tumour with a BRD4 microdeletion; red arrow 
indicates the rearrangement (top). Breakpoint density across PCAWG breast 
and ovarian cancers (middle). Enhancer locations from breast (BRCA) and 
ovarian (OV) tissue™ (bottom). d, Somatic copy number at the BRD4 and 
NOTCH3 locus in breast and ovarian cancers with (SV+) and without (SV-) 
rearrangements. e, Gene expression per absolute copy number for BRD4 and 
NOTCH3.f, The 30 most-significant SRJs, with their relative enrichment (circle 
size) per tumour type, annotated with oncogenic fusions from the Catalogue of 
Somatic Mutations in Cancer (COSMIC) (left), CGC gene (centre) and protein 
disruption (right) (Methods). A7PSE is also known as A7PS5FIE. g, Expression 


are associated with a lower expression of BRD4 in breast (P= 0.001) 
and ovarian tumours (P= 0.04), but not of the neighbouring gene 
NOTCH3 (Fig. 3e). The focal deletions in BRD4 overlap a prominent 


correlates of rearrangements in SRJs from COSMIC (purple), other SRJs (pink) 
ornotinany SRJ (grey). For each rearrangement (R), the primary locus (left) is 
defined as the breakpoint within 100 kb of the gene that is most overexpressed 
in rearranged samples; the secondary locus (right) is the other breakpoint. 
Expressionat the primary locus in samples with the rearrangement relative to 
samples without the rearrangement is greater for SRJs than for other 
rearrangements (left). The tissue-specific expression at the secondary locusin 
wild-type (WT) samples, relative to samples of different tissue types, is greater 
for SRJs than other rearrangements (right). Pvalues represent comparisons to 
‘notin SRJ’.d,e, g, Box plots show the interquartile range, median and 95% 
confidence interval; two-sided t-test. h, TERT promoter mutations and 
rearrangements across PCAWG melanomas. i, Rearrangements between 7ERT 
promoter and BASP1 and MYO10 locus result in focal amplification and 
relocation of distal enhancers to TERT. AML, acute myeloid leukaemia; 
Colorect, colorectal; Leiomyo, leiomyosarcoma; MPN, myeloproliferative 
neoplasm; Osteosarc, osteosarcoma; PiloAstro, pilocytic astrocytoma. 


exon-1 H3K4me3 peak and intron-1 enhancer elements in HMEC (nor- 
mal breast) and MCF-7 (breast tumour) cells (Extended Data Fig. 8e), 
which suggests that these deletions disrupt regulatory elements. 
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To our knowledge, this is the first evidence of a recurrent microdeletion 
limiting expression of an amplified gene. 


Recurrent fusions target gene regulation 


Motivated by the detection of fusion-like SRBs, we specifically looked 
for genomic loci that were juxtaposed more often than expected by 
chance, after controlling for both the rate of breakpoints at each locus 
and the distance between them (Methods). We identified 90 such SRJs 
(Fig. 3f, Supplementary Table 15), including 13 known oncogenic fusions 
(including all 8 fusion-like SRBs) and 77 novel hits—18 of which linked 
to at least one known cancer gene (Supplementary Note 8). Previously 
reported oncogenic SRJs were observed more frequently (average 
24 patients per fusion, range 2-98) than novel ones (most often 2 
patients per fusion, range 2-4). As juxtapositions are unlikely to occur 
by chance, observing even two becomes highly significant. However, it 
is possible that some SRJs reflect inaccuracies in our background model 
rather than true drivers. We therefore further evaluated the SRJs on the 
basis of (i) a‘robustness factor’ that indicates how much the background 
rate couldincrease before the SRJ would become insignificant, and (ii) 
the ratio between the observed and expected numbers of events under 
the current background model (‘effect size’) (Extended Data Fig. 9a). 
Twenty-six SRJs, including 11 of the 13 known drivers and 15 newly iden- 
tified SRJs, are robust to tripling the expected background rate, and 
22 others would remain significant with a doubled rate. 

Most canonical driver rearrangements have previously been found 
in single tumour types, often associated with tissue-specific expres- 
sion®*°, We found that 9 of our top 10 SRJs are tissue-specific, despite 
searching across 30 different types of tumour. Such tissue specificity 
is not observed for cancer genes affected by SCNAs, for which the top 
10 are altered in 11.9 cancer types (on average), or by point mutations 
(for which the top 10 are altered in 6.7 cancer types, on average) (Sup- 
plementary Table 16). 

The tissue specificity of SRJs suggests that they are strongly shaped 
by epigenetic state, either owing to mechanistic reasons (for example, 
tissue-specific three-dimensional proximity of the two DNA break- 
points) orto selection that connects tissue-specific regulatory elements 
with oncogenes?" **, The latter seems to be more likely because: (i) SRJs 
are associated with significant overexpression of only one of the rear- 
rangement partners (the ‘primary locus’) relative to randomly selected 
rearrangements (primary locus, P<10~“ (Fig. 3g left); secondary locus, 
P> 0.05 (Extended Data Fig. 9b left)); (ii) the rearrangement partner, 
inthe secondary locus, tends to be highly expressed in that tissue type 
relative to others (Fig. 3g right); and (iii) the distance to the nearest 
tissue-specific enhancer is smaller for SRJs than for rearrangements 
overall (Extended Data Fig. 9b). These observations suggest that SRJs 
act in general by bringing regulatory elements to an oncogene that is 
otherwise expressed at a low level. 

In many cases, SRJs generate truncated or chimeric proteins, and 
breakpoints within introns or exons were indeed overrepresented 
(68% versus 56% expected, P< 107’). However, only 11 of the 30 (37%) 
most significant SRJs generated novel proteins in all samples, and 6 
others sometimes generated novel proteins; the rest were either non- 
disruptive or contained breakpoints within the first two introns of the 
disrupted gene, leaving most of the protein intact** (Fig. 3f). Moreover, 
SRJs that generate novel proteins exhibited expression changes similar 
to those that do not (P=0.4) (Extended Data Fig. 9c). We conclude that 
altering gene expression is a key function of both classes of SRJs, and 
that SRJs are akin to non-coding driver point mutations that act on 
regulatory elements. 

We found several SRJs that involve amplified oncogenes, including 
MDM2, EGFR and TERT (Fig. 3f, h, i, Extended Data Fig. 9d-f, Supple- 
mentary Table 15). The TERT promoter region was juxtaposed in four 
melanomas (P< 10°’) to a region in the BASPI gene (both on chromo- 
some 5), and to aregion near NDUFC2 (t(5,11)) in two melanomas and 
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one medulloblastoma (P< 10%). Both juxtaposed regions were marked 
with melanocyte enhancers, which suggests that they could drive TERT 
expression. Among melanomas, these rearrangements are mutually 
exclusive with the C228T and C250T mutations of the TERT promoter 
(P<10°°) (Fig. 3h). Because the juxtapositions were always part of com- 
plex events that also amplified TERT, increased TERT expression may 
be due to amplification, the juxtapositions or both. 


Paucity of non-coding drivers in cancer 


Our analyses of genomic hotspots, functional elements, genomic 
windows and SRJs all suggest that non-coding drivers are rare com- 
pared to protein-coding drivers. This might, in part, be due to a lack 
of discovery power’. We therefore evaluated the discovery power of 
mutational-burden tests for recurrent events across the different types 
of element in our tumour cohorts, focusing first on point mutations*”*. 
We found that the fraction of mutated patients required for a driver 
to reach 90% discovery power ranged from <1% in large cohorts with 
low background-mutation densities to 25% in small cohorts with high 
background-mutation densities (Fig. 4a). Different types of element 
were similarly powered, suggesting that the paucity of drivers in non- 
coding versus coding elements is not due to alack of power. Similarly, 
our power to detect SRJs was higher in large cohorts with low rear- 
rangement rates, and for long and interchromosomal rearrangements 
owing to their lower overall rates (Extended Data Fig. 10a): we were only 
powered to detect events that recur in 5-20% of samples in most types 
of cancer (Fig. 4b). Moreover, beginning with about 2,500 tumours, 
we expect to find anew SRJ with every 25 additional genomes (Fig. 4c). 

Low sequencing coverage (for example, in GC-rich regions*’) also 
limits driver discovery. To measure this effect in the PCAWG data, we 
quantified our ability to detect mutations (detection sensitivity)”* in 
cancer gene promoters. Although the mean detection sensitivity in 
promoters is high (41.9% of genomic positions have mean detection 
sensitivity >80% across tumours), only 4.1% of the promoters had detec- 
tion sensitivity >90% in >90% of bases. In particular, the two canoni- 
cal TERT promoter hotspots had highly variable detection sensitivity 
among patients and cohorts, from only 3% of patients in the central- 
nervous-system pilocytic astrocytoma cohort to 100% in the thyroid 
adenocarcinoma cohort (Extended Data Fig. 10b). From these data, 
we inferred the expected number of TERT events in each tumour type 
(Extended Data Fig. 10c) and found that about 263 (95% confidence 
interval 232-295) TERT hotspot mutations were probably missed owing 
toalack of detection sensitivity. Moreover, on average 9.9% (1.3-13.0% 
interquartile range) of the cancer gene promoter territory inthe tumour 
of each patient was severely underpowered (an average detection sen- 
sitivity of <10%). Therefore, the lack of coverage in promoters may 
contribute to the paucity of non-coding drivers. 

To determine whether the paucity of non-coding drivers discov- 
ered thus far could be due to the limited statistical power of current 
datasets, we estimated the overall excess of point mutations above 
background (that is, the expected number of driver events) in coding 
and cis-regulatory non-coding sequences in 603 cancer genes* (Meth- 
ods, Supplementary Table 7, Supplementary Note 11). To minimize 
the effect of samples with low detection sensitivity, we included only 
936 samples with >90% detection sensitivity at the two TERT promoter 
hotspots (Extended Data Fig. 10c, d, Supplementary Note 11). Overall, 
this approach predicted more than 1,475 driver mutations (95% confi- 
dence interval 1,410-1,687; 1,069 SNVs and 406 indels) in the protein- 
coding sequences of these cancer genes (Fig. 4d), compared to only 
96 (95% confidence interval 30-190) estimated driver mutations in 
promoters (73 attributed to TERT), 22 (95% confidence interval 0-88) in 
S’UTRs, and 68 (95% confidence interval 0-178) in 3’ UTRs. Non-coding 
mutations in cancer-gene promoters were also not generally associ- 
ated with loss-of-heterozygosity or altered expression, as one would 
expect if they were enriched with drivers (Supplementary Note 12). 


40 | Abundance of powered (290%) driver 
<5% [5 - <10% [ll 10- <20% fy >20% 
Number of patients required to find driver 


Mutations 
per Mb 


i | 
cos HE 
Enhancer [zy Ei El 
aan 16 | 16 | 15 | 14 
37 UR BBE 


BEBE) 


i oes 
BEDE 


DV WV VoD DB) fd’ 0 Nod XV cB.S) DO DB) Vado 3) 
PPP MOS BAG GOP F BADAAR AP 
VS OO OP PP KL KS POI L CEL 
& Se cS ve oe & LLL S LESS SNA 
on g © 5&2 
FFF LEFF yh re © & 
SEE EE S Oy xf x SMX ore 
eo Pee Q 
PGE PK OF Vere 
¥ 
© 3 
b £ 2 Sao, £4 SE Q c 
@ SifOS0 SL oS £ 
2 2532935 825 5s 6 
xo) TOO§ ee 2H0 So oO 
& Bisypseyert f= 7 
d SE ETSE LES at : oa 
6 S23 2E62 E75 38 a 
@ 500 -—+—» Ir 
— E I, 1 t ll 1 miiaa 
oo a ee ae t ened | 2% | € 
oc ' Tita id i o 
~ © j ae i Lilie A tnt 1 Prevalence (3) F be 
2° Pitt | a | = 2 60, All juxtapositions 
no | ist * amis a 
gx 100-4 H ae 3 1 cee \ " f 5% @ = 
ES tx | $ PE ag) tah H os 
o } area 1 gg PR 0 5 6 ° 
Bo bt td 110% ss A 
5 g 50> tod ! aa a a H 2 % 301 'o OF |ow prevalance 
5 1 oa + 1 er ta * —> (rate < 12%) 
38 b ott ee 
e 5 1 TY H a a neo% 2 °. % High prevalance 
ss I aa hour cd 1 (rate => 12%) 
28 ft tt | I mee i a : 
10 i, —_— 0 5 
0 10 100 1,000 500 1,000 1,500 2,000 2,500 


Rearrangements per sample 


d promoter [F Zz 
5’ UTR } oz 
CDS 
3/ UTR E : 
0) 1,000 2,000 3,000 4,000 0 


Observed mutations 


Fig. 4 | Power considerations and paucity of non-coding drivers. a, Heat map 
shows the minimal frequency of a driver element with >90% discovery power. 
Power is dependent on the background mutation frequency (above the heat 
map), the element length (median length depicted in Extended Data Fig. 2c) 
and the number of patients with mutations (cell numbers). For example, the 
pan-cancer cohort is powered to discover a protein-coding driver gene (coding 
sequence (CDS)) present in <1% (18 patients), whereas the Bladder-TCC cohort 
is only powered to discover drivers present in at least 27% (6 patients). 

b, Number of samples required to detect 90% of recurrent juxtapositions 
across 90% of pairs of loci, as a function of the median number of 
rearrangements per sample and the rate above background at which the fusion 
recurs (solid lines). The vertical dashed lines represent the median 


These results collectively indicate that, independently of statistical 
power, non-coding cis-regulatory driver mutations in known cancer 
genes besides TERT are much less frequent than protein-coding drivers. 


Discussion 


The accurate and reliable discovery of genomic drivers intumours may 
have critical implications for patients with cancer. Our findings and 
the methods introduced here for the discovery of point-mutation and 
structural-variant drivers, method integration, vetting of candidates 
and identification of local hypermutation and fragile sites represent 
animportant contribution to the collective effort towards charting all 
malignant changes that drive the cancer of each patient?. 

Among the most interesting candidate non-coding driver elements 
we uncovered are the 5’-end mutations in 7P53; 3’ UTR mutations in 
NFKBIZ and TOB1I; and rearrangements involving AKRIC genes and 
BRD4. By careful analysis of the whole-genome sequencing data, we 
found that several previously reported and frequently altered non- 
coding elements may not be genuine drivers, including (i) the non- 
coding RNAs, NEAT1 and MALATI1 (which containa high density of indels, 


TERT promoter mutations 
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rearrangement rates of each cancer type, and the stars on these lines indicate 
the numbers of whole genomes analysed for that cancer type. c, Number of 
SRJs detected after downsampling the data to various sample sizes, separately 
indicating rearrangements that recur at high (=12%; red) and low (<12%; black) 
rates above background; their sum (blue). d, Number of observed mutations 
(SNVs and indels) in cis-regulatory and coding regions of 603 protein-coding 
cancer genes with the expected numbers shown in lighter colours (left). Right, 
the number of excess mutations (that is, the estimated number of driver 
mutations) (right). The grey fraction of promoter mutations indicates TERT 
events. Error bars show 95% binomial confidence intervals. Only samples with 
high detection sensitivity were included (n= 936). 


seemingly owing to atranscription-associated mutational process) and 
(ii) recurrent structural variants in regions of late replication, indicat- 
ing DNA fragility. 

This study yielded unexpectedly few non-coding driver point muta- 
tions and structural variants. SRJs, which appear to act largely through 
the rearrangement of regulatory elements, are less frequent than 
SCNA-like SRBs, which directly amplify or delete coding sequences. 
The results from five analyses-—hotspot recurrence, driver-element 
discovery, structural variants, discovery power and aggregated muta- 
tional excess—-suggest that this paucity is not caused by a particular 
analysis strategy, but that regulatory elements truly contribute amuch 
smaller number of recurrent cancer-driving events than protein-coding 
sequences. This paucity of non-coding drivers contrasts with the dis- 
tribution of germline polymorphisms associated with heritability of 
complex traits, which are most frequently located outside of protein- 
coding genes”. 

Atleast two factors contribute to the relative paucity of non-coding 
driver mutations in cancer: (i) the differential fitness effects of coding 
and non-coding mutations and (ii) the target size of functional ele- 
ments. The paucity of promoter driver mutations in well-established 
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cancer genes suggests that point mutations markedly affect the func- 
tion ofnon-coding regulatory elements only rarely. This highlights TERT 
as anotable exception, perhaps because even a modest increase in TERT 
expression may suffice to circumvent normal telomere shortening. 
For other cancer genes, directly mutating protein-coding sequences 
or altering expression levels by copy-number change may provide 
larger phenotypic effects. For example, complete loss-of-function 
by nonsense mutations or deletions may be easier to achieve than by 
disrupting or translocating regulatory regions. 

Technical shortcomings (such as coverage ‘blind spots’ in GC-rich 
promoters and different filtering strategies) may cause genuine driv- 
ers to be missed*’. Therefore, the discovery of non-coding drivers will 
benefit from technical improvements, including even sequence cover- 
age, longer and accurate reads, and improved variant-calling methods. 
Moreover, better annotation of functional non-coding elements will 
increase both the power to discover infrequently mutated driver ele- 
ments and their interpretability. As datasets grow, yet-unidentified 
mutational mechanisms targeting particular genomic regions will 
emerge and require improved background models, including additional 
covariates and more-sophisticated statistical models. The analysis of 
structural variants has greater challenges because (i) accurately model- 
ling their background density is complicated by their lower frequency 
and larger fraction of drivers (Supplementary Note 6); (ii) their target 
genes may be far from the breakpoints, as in SCNAs; (iii) the space for 
modelling SRJs is much larger (the genome squared); and (iv) many 
structural variants are part of complex events that often involve multi- 
ple chromosomes”, so that the resultant topology cannot be deduced 
without technologies such as long- or linked-read sequencing**”°. For 
these reasons, experimental validation remains important for all—and 
especially for non-coding—candidate drivers. 

Our work suggests that larger datasets and technological advances 
will continue toidentify new non-coding drivers, albeit at considerably 
lower frequencies than protein-coding drivers. We anticipate that the 
approaches developed here will provide a solid foundation for the 
incipient era of driver discovery from ever-larger numbers of cancer 
whole genomes. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 

Detailed methods are provided as Supplementary Methods. 


Dataset generation 

Out of 2,955 samples, we selected 2,583 unique donor samples for SNV 
and indel driver-discovery analysis on the basis of SNV quality control 
(Supplementary Methods). We found that 110 additional myeloid-AML 
samples had robust structural variant calls despite SNV artefacts; we 
included these in structural variant analyses, for a total of 2,693 sam- 
ples. For tumour-type cohort analyses, we used only cohorts with at 
least 20 patients. Tumour meta-cohorts were defined by cell type of 
origin or by organ system (for example, lung for lung adenocarcinoma 
and lung squamous cell carcinoma). A pan-cancer meta-cohort was 
created by combining all tumour cohorts except for Skin-Melanoma 
and lymphoid tumours (Supplementary Methods). 


Hotspot SNV analysis 

We selected the 50 most-frequent SNV hotspots. These were analysed 
to identify known driver events; mutational signature biases related 
to sequence palindromes, immunoglobulin loci and so on; and poten- 
tial artefacts, including regional mapping problems (Supplementary 
Methods). 


Mutational signatures 

We performed de novo global-signature discovery and signature 
attributions with SignatureAnalyzer’s Bayesian non-negative matrix 
factorization method”, based on 1,697 channels—including 1,536 
pentanucleotide sequence contexts for single-base substitutions, 83 
indel features, and 78 doublet-nucleotide substitution classes (Sup- 
plementary Methods). 


Definition of genomic elements 

GENCODE v.19 (ref.*°) and other genomic resources were used to define 
functional genomic elements, including protein-coding genes (CDS, 
splice sites, 5’ UTR, 3’ UTR and promoters), long non-coding RNAs (gene 
body, splice site and promoters), short RNAs, miRNAs and enhancers 
(Supplementary Methods). 


Candidate-driver-mutation identification methods and 
combination of results 

We obtained results (P values) from 13 methods of driver discovery, 
including ActiveDriverWGS*™, CompositeDriver, DriverPower®, 
dndscv*, ExInAtor®®, LARVA*’, MutSig tools?, NBR’, ncdDetect*’, 
ncDriver’, OncodriveFML® and regDriver™. We integrated the results 
of all these methods using a custom framework based ona previously 
published method® for combining P values. Results from individual 
methods that showed large deviations from the expected uniform null 
distribution of Pvalues were excluded. This approach was evaluated on 
real and simulated data. We controlled the FDR within each of the sets 
of tested genomic elements by concatenating all combined Brown’s 
P-values from across all tumour-type cohorts and applying the Ben- 
jamini-Hochberg procedure®. Cohort-element combinations with 
Qvalues < 0.1 were designated as significant hits, and combinations 
with 0.1< Q<0.25as ‘near significance’. Extensive details are provided 
in the Supplementary Methods. In addition, we tested for element- 
independent recurrence with the NBR method on 2-kb bins spanning 
the entire genome, and non-coding ultraconserved regions™. 


Post-filtering of driver mutation candidates 
We applied stringent filters to discern positive selection from tech- 
nical artefacts and mutational processes. We required at least three 


mutations to be present in candidate elements, in at least three patients 
of the tested cohort; more than 50% of mutations in mappable regions; 
less than 50% of mutations in palindromic DNA; and less than 50% of 
mutations attributed to APOBEC activity. For lymphoid tumours and 
skin melanoma, we required that <35% and <50% of mutations were 
attributed to the AID and UV-light mutational signatures, respectively. 
The FDR was recalculated after post-filtering. 


Candidate driver structural-variant analyses 

We applied separate analyses to detect recurrent structural variant 
breakpoints and recurrent juxtapositions. For each analysis, we first 
binned breakpoints, accepting only one breakpoint per sample per bin. 
We then determined which bins had more breakpoints than expected by 
chance (the SRB analysis), and which pairs of bins (or ‘tiles’) were joined 
by more rearrangements than expected by chance (the SRJ analysis). 


Candidate driver breakpoints 

We calculated the background rate of breakpoints per bin based on 
a Gamma-Poisson model® that took into account genomic covari- 
ates, breakpoint counts normalized by the number of bases within 
each bin that had sufficient mappability to be eligible for breakpoint 
detection and accounted for an observed overdispersion of breakpoint 
counts that probably reflects unaccounted-for covariates (Supple- 
mentary Methods). We used the Gamma-Poisson model to calculate 
the P value for each bin (that is, the probability that each bin would 
exhibit the observed number of breakpoints (or greater) by chance 
alone), applying the Benjamini-Hochberg procedure® to correct for 
multiple hypotheses. 


Post-filtering of driver breakpoint candidates 

We scored each recurrent breakpoint locus on the basis of the average 
replication timing of its breakpoints, and filtered those loci with scores 
>0.5 as probable fragile sites®. 


Candidate driver juxtapositions 

We developed a background model to indicate the probability that two 
loci would be joined, taking into account the observed rate at which 
each locus underwent DNA breaks (from the breakpoint analysis), the 
distance between them and the propensity for these rearrangements 
to reflect a break followed by invasion versus two breaks that were then 
joined. We determined the probability that each tile would contain the 
observed number of rearrangements using a binomial test, followed 
by controlling for multiple hypothesis testing using the Benjamini- 
Hochberg procedure®™. 


Gene-expression analyses 

Gene-expression data were provided by the PCAWG Transcriptome 
Core Group“, and also generated using the same approach for an 
extended set of non-coding transcripts (Supplementary Methods). 


Additional evidence for selection 

In addition to associations between mutations or structural variants 
and expression, we looked for signals of copy-number-alteration recur- 
rence using the GISTIC2 algorithm”. We also tested whether driver can- 
didates showed significantly higher frequency of loss-of-heterozygosity 
in mutated samples using Fisher’s exact test. We calculated cancer 
allelic fractions using ploidy and tumour purity predictions froma 
previous publication®. 


Mutational process and indel enrichment 

For every gene, we calculated the proportion of indels of length 2-5 
bp out of the total number of indels. This proportion was compared 
to the genome background proportion using a binomial test. We also 
compared the indel rate per gene (not distinguishing by length) tothe 
background. Both sets of Pvalues were corrected with the FDR method. 
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Power calculations 

We estimated our power to discover driver elements mutated at a par- 
ticular frequency in the population as previously described*"’, but 
solving for the lowest frequency for a driver element in the patient 
population that is powered (>90%) for discovery. The calculation of this 
lowest frequency takes into account (i) the average background muta- 
tion frequencies for each cohort-element combination; (ii) the median 
length and average detection sensitivity for each element type and 
patient cohort size; and (iii) a global desired false-positive rate of 10%. 
The effect of element length is discussed in Supplementary Note 10, and 
details are provided in Supplementary Methods. Power calculations for 
detection of recurrent juxtapositions was performed similarly, except 
over atwo-dimensional genomic fusion map divided into 100 x 100-kb 
tiles (Supplementary Methods). We performed this analysis first as a 
function of the distance between breakpoints (Extended Data Fig. 10a) 
and second asa function of the median number of rearrangements per 
sample, spanning values represented by histologies with more than15 
samples (Fig. 4b). 


Estimation of the number of mutations in non-coding regions of 
known cancer genes 

NBR was used to estimate the background mutation rate expected 
across cancer genes, using a conservative list of 19,082 putative passen- 
ger genes as background and including as covariates the local mutation 
rate, gene expression and averaged copy-number states. The resulting 
model predicted the number of passenger SNVs and indels expected 
by chance. By aggregating the expected numbers over 603 known 
cancer genes from the CGC® (CGC v.80) (Supplementary Table 7), we 
compared the observed and expected numbers of mutations. For this 
analysis, we excluded samples with problems of low detection sensitiv- 
ity (Supplementary Methods). 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Data associated with this Article are available at https://dcc.icgc.org/ 
releases/PCAWG/drivers. SRBs and SRJs are available at www.svscape. 
org. A list of data files used for analyses in this paper is provided in Sup- 
plementary Table 20. Somatic and germline variant calls, mutational 
signatures, subclonal reconstructions, transcript abundance, splice 
calls and other core data generated by the ICGC and TCGA PCAWG 
Consortium are described in an accompanying Article’®, and are avail- 
able for download at https://dcc.icgc.org/releases/PCAWG. Additional 
information on accessing the data, including raw read files, can be 
found at https://docs.icgc.org/pcawg/data/. In accordance with the 
data access policies of the ICGC and TCGA projects, most molecular, 
clinical and specimen data are in an open tier that does not require 
access approval. To access information that could potentially identify 
participants, such as germline alleles and the underlying sequencing 
data, researchers will need to apply to the TCGA data access committee 
via dbGaP (https://dbgap.ncbi.nIm.nih.gov/aa/wga.cgi?page=login) 
for access to the TCGA portion of the dataset, and to the ICGC data 
access compliance office (http://icgc.org/daco) for the ICGC portion 
of the dataset. In addition, to access somatic single-nucleotide vari- 
ants derived from TCGA donors, researchers will also need to obtain 
dbGaP authorization. 


Code availability 


The core computational pipelines used by the PCAWG Consortium for 
alignment, quality control and variant calling are available to the public 


at https://dockstore.org/search?search=pcawg under the GNU General 
Public License v.3.0, which allows for reuse and distribution. Code for 
Pvalue combination from multiple driver methods is available from 
https://github.com/broadinstitute/getzlab-PCAWG-pvalue_combina- 
tion/. Power calculation methods are available from https://github. 
com/broadinstitute/getzlab-PCAWG-power_calculations. Structural 
variant methods are located at https://github.com/mskilab/fishHook, 
https://github.com/walaj/ginseng and https://github.com/walaj/SVsig. 
Links to individual driver discovery methods are provided in the cor- 
responding section of the Supplementary Methods. 
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site hotspots across all analysed individual cohorts and meta-cohorts. 
Hotspots are grouped as known drivers or induced by mutational processes. 


alterations for protein-coding genes. 
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Extended Data Fig. 2| See next page for caption. 
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Extended Data Fig. 2| Element-based driver discovery and combination of 
Pvalues.a, Schematic describing definition of types of functional element 
(Methods). Functional elements (black) are defined on the basis of transcript 
annotations from various databases. Elements arising from multiple 
transcripts with the same gene identity are collapsed, as seen here for the 
protein-coding isoforms. Promoter elements are defined as 200 bases 
upstream and downstream of the transcription start sites of the transcripts ofa 
gene (green). Splice site elements extend 6 and 20 bases from the 3’ and 5’ 
exonic ends into intronic regions, respectively (light blue). Regions 
overlapping protein-coding bases and protein-coding splice sites are 
subtracted from other regions. b, Percentage of genomic coverage for each 
element type. c, Distribution of element lengths for each element type. Thick 
lines indicate interquartile ranges and short horizontal bars indicate the 
medians. d, Organization of meta-cohorts defined by tissue of origin and organ 
system. Pan-cancer contains all cancers, excluding Skin-Melanoma and 


lymphoid malignancies. e, Combination workflow: overview of methods of 
driver discovery and their lines of evidence to evaluate candidate gene drivers. 
Methods using each feature are marked with a box in the appropriate track. 
Heat map displaying Spearman’s correlation of Pvalues across the different 
driver-discovery algorithms based on simulated (null model) mutational data. 
Dendrogram illustrates the relatedness of method Pvalues, and algorithm 
approaches are marked by coloured boxes on dendrogram leaves. Next, 
Pvalues are combined with Brown’s method on the basis of the calculated 
correlation structure. Individual method (left) and integrated (right) log- 
transformed P values are shown ina heat map (grey, missing data). Post- 
filtering used several criteria to identify likely suspicious candidates. 
Significant driver candidates were identified after controlling for multiple 
hypothesis testing based on an FDR Q value threshold of 0.1 (blue asterisk). 
Candidates with Q values below 0.25 (blue dash) were also considered of 
interest. 
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Extended Data Fig. 3 | Sensitivity of driver-discovery methods and filter 
Statistics. a, Percentage of coding-driver discovery runs (with stable F, score, 
n=33), across all cohorts, in which the method had the highest F, score 
(Methods). b, F, score of different methods of driver discovery, and different 
combinations evaluated in the four largest cohorts (pan-cancer (n=2,278), 
carcinoma (n=1,856), adenocarcinoma (n=1,631) and digestive tract (n=797)). 
Only methods that used the same algorithm to call coding and non-coding 
drivers were evaluated. Vertical lines indicate 95% confidence intervals. 
Horizontal black lines mark the median in each group. Pvalues were calculated 
with the two-sided non-parametric Mann-Whitney Utest.c, Ontop, the initial 
number of hits identified as recurrently mutated for each element type. The 
element types mature miRNA (n=2 before filtering) and miRNA promoters 
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(n=16 before filtering) were omitted from the table. The heat map shows the 
number of hits filtered at each step in the sequential application of filters and 
post-filtering re-application of the FDR correction. Background colours 
indicate the corresponding percentage of input element removed. The final 
numbers of hits (including those that were later filtered by the comprehensive 
vetting procedures) are indicated below the heat map. d, Sensitivity versus 
specificity in individual cohorts versus meta-cohorts for candidate drivers: 
Qvalues for the most significant individual cohort (x axis) versus meta cohort 
(yaxis) are shown. Driver elements are coloured by their element type. Q values 
derived from combination of Pvalues from individual driver-discovery 
methods (Methods). 
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Extended Data Fig. 4 | Mutation-to-expression correlation and focal copy- For TERT, only samples powered to call mutation status were used. Pvalues are 
number alterations. a, Expression is compared between mutated and non- based onatwo-sided Wilcoxon rank-sum test. Bars indicate means. b, Copy- 
mutated samples. For each element, the zscore of the expression values for number profiles of 55 of 441 stomach adenocarcinomas from TCGA show copy- 
mutated and wild type in the significant cohort is plotted. For copy number, number gains around HESI. TOB1 and its gene neighbour WFIKKN2are focally 
CNA amplification indicates CNA >10; CNA gain indicates CNA 23; CNA loss amplified in cancer (172 of 10,844 total samples from 33 cancer types are 
indicates CNA <1; and noevents indicates CNA <3 and CNA > 1. Ifa patientis shown). RMRP focal amplifications in TCGA cancers (160 of 10,844 total 


mutated with multiple types of point mutation, indels are indicated over SNVs. tumours shown). 
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Extended Data Fig. 5 | Non-coding driver candidates. a, M7G2 promoter locus 
(left) and associated gene-expression changes in carcinoma tumours (right). 
Expression of M7G2in mutated (n= 3) versus the carcinoma meta-cohort wild- 
type cases (n= 896). Two-sided Wilcoxon rank-sum test. Bars represent means. 
b, Genomic locus of NFKB/Z3’ UTR (left) and associated gene-expression 
changes in Lymph-BNHL (right). Expression of NFKB/Z in mutated (n= 6) versus 
wild-type cases (n= 98). Test and bars as inb. c, Genomic locus of the RMRP 
transcript and promoter region (left). RMRP is an RNA component of the 
endoribonuclease RNase MRP, the function of which depends onits RNA 


secondary and tertiary structure. The RNA secondary structure, tertiary 
structure interactions, protein and substrate interactions, and mutations with 
their predicted structural effect (right) of RMRP;lymphomaand melanoma 
mutations are excluded. d, M/R142 locus and mutations in patients with 
lymphoma with the AID signature annotation. e, Manhattan-style plot showing 
significance of mutation recurrence enrichment for genomic bins (top) and 
ultraconserved elements (bottom) across cohorts (Methods; Supplementary 
Table 9). 
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Extended Data Fig. 6 | Atranscriptional process creates passenger 
mutations in highly expressed, tissue-specific genes. a, Relative rate of loss- 
of-heterozygosity (LOH) compared between mutated and wild-type samples 
for all significant elements, coloured by element type and highlighting 
significant LOH enrichments with an outside black circle (Fisher’s exact test, 
one-sided; Q< 0.1). b, Average cancer allelic fraction (CAF) compared between 
each significant genomic element and the corresponding flanking regions 
(+2 kb and introns; overlapping coding exons were excluded). The size of the 
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points represents the number of mutated samples for each particular element. 
Genes with significantly higher CAFs (t-test, one-sided; Q< 0.1) are highlighted 
with an outside black circle. c, mRNA expression of genes enriched in 2-5-bp 
indels in their respective tissues. Boxes show the interquartile range and 
median. The first box contains background gene-expression levels. Red and 
grey dots correspond to samples with (m) and without (n- m) indels in the 
corresponding gene. d, Heat map showing the levels of expression across 
types of cancer for the genes enriched in 2-5-bp indels. 
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junctions inthree hypothetical genomes (top) andthetwoanalysisapproaches —_across2,693 cancer genomes (Methods). 
(bottom): the 1D analysis for recurrent breakpoints and the 2D analysis for 
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Extended Data Fig. 8 |Gene-expression effects of SRBs. a, Fraction of 
recurrent breakpoint loci associated with biallelic inactivation of aknown 
tumour suppressor gene (frag-SCNA, 0/12; neutral-SCNA, 0/14; del-SCNA, 5/8; 
Fisher’s exact test). b, Distance in bp to the nearest tissue-specific enhancer for 
each breakpoint class. Dashed grey line represents 1,000 randomly selected 
breakpoints fromthe same tumour samples. All box plots show the 
interquartile range, median and 95% confidence interval. c, Expression fold 
change for the gene with the most-altered expression within 1 Mb of the cluster 
centroid insamples with, compared to samples without, a breakpoint at the 
cluster locus. Random controls (in dashed boxes) represent 1,000 randomly 
selected breakpoints. Pvalues are from two-sided t-tests (Methods). 

d, Breakpoint density near AKRIC genes (top), locations of enhancers (middle) 
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and expression of local genes (bottom; n=7 SV+ tumours, n=41SV-lung 
squamous cell tumours; two-sided ¢-test) in samples with and without local 
rearrangements. e, Ratio of tumour-to-normal read coverage across six breast 
tumours and eight ovarian tumours with focal BRD4 exon 1andintron1 
deletions. Red lines indicate rearrangements. f, Amplification structure 
(absolute copy number, y axis) of the BRD4 and NOTCH3 locus in breast and 
ovarian tumours with a BRD4 focal deletion. In most cases, the copy-number 
caller identified the focal deletion. However, in some cases, the deletions were 
toosmall to be identified only using read depth. When combining read depth 
and rearrangement signals ina, there is clear evidence for focal deletions. 
Deletion locations are marked by an asterisk. 
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Extended Data Fig. 9| See next page for caption. 
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Extended Data Fig. 9 |Gene-expression effects of SRJs. a, Assessment of SRJ 
robustness against unaccounted for mechanistic and technical confounders. 
Left, arobustness factor, defined as the ratio between the background 
probability value that would lower the P value of an SRJ below the genome-wide 
P-value threshold and the estimator for the background probability from our 
2D model. Higher robustness values represent lower susceptibility to 
unaccounted variations inthe background model. The top 48 SRJs havea 
robustness factor greater than 2, which suggests that these SRJs would remain 
significant even if the true background rate was twice as high as our model 
estimates. Right, the effect size is calculated as the difference in observed and 
estimated number of SRJsin units of standard deviation (assuming binomial 
distribution of structural variant count per 2D genomic region). Most SRJs are 
well above ten standard deviations of the predicted value. b, Characteristics of 
SRJ secondary loci. Left, fold expression enrichment of the most highly 
overexpressed gene in the secondary locus in cancer samples with these 


fusions relative to cancers of the same histology without the fusion. Right, the 
distance from the SRJ secondary locus (green) to the nearest enhancer is 
significantly smaller (P< 0.05; two-sided t-test) compared to randomly 
selected breakpoints (grey). c, Fold expression enrichment of the most highly 
overexpressed gene in the primary locus, for fusions that disrupt protein- 
coding sequences and fusions that do not. All box plots show the interquartile 
range, median and 95% confidence interval. d, Rearrangements between the 
TERT promoter and the BASP1 and MYO10 locus result in focal amplification of 
TERT and relocation of distal enhancers to TERT. e, TERT-NDUFC2 fusion in two 
melanoma samples connecting 7ERT with an enhancer-rich region next to 
NDUFC2. Both samples also have focal amplifications of TERT. f, Recurrent 
translocation between EGFR in chromosome 7 and the KL and STARDI3 locus on 
chromosome 13. Inall three samples, the rearrangement contributed to the 
amplification of EGFR. 
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Extended Data Fig. 10 | Alack of detection power in specific elements. 

a, Number of tumour-normal pairs needed to detect fusions with 90% power as 
a function of the span of the fusion and the rate above background at which it 
recurs. The red asterisks indicate the numbers of samples required to detect 
100-kb and100-Mb fusions that recur at 0.5% above their background rates. 

b, Distribution of TERT promoter hotspot (top, chromosome 5:1,295,228; 
bottom, chromosome 5: 1,295,250; hg19) detection sensitivity for each patient, 
by cohort. Grey dots indicate values for individual patients inside estimated 
distribution (areas coloured by cohort). Horizontal black bars mark the 
medians. Numbers above distributions indicate the percentage of patients 
powered (detection sensitivity > 90%) in each cohort. Cohort sizes as in Fig. 4a. 
c, Percentage of patients with observed (blue) and inferred missed (red) 
mutations at the chromosome 5:1,295,228 and chromosome 5:1,295,250 TERT 
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promoter hotspot sites. Error bars indicate 95% Poisson confidence interval. 
Numbers above bars show the total inferred number of TERT promoter 
mutations for each site in this cohort. Red numbers indicate the absolute 
number of inferred missed mutations (owing toa lack of read coverage). 
Cohort sizes as in Fig. 4a.d, Detection sensitivity for the two TERT promoter 
hotspots across all samples showing the variation in powered samples. Red 
vertical line (x=0.9) indicates cutoff for ‘sufficiently powered samples’. 

e, Mean detection sensitivity in 1,000 randomly selected putative passengers 
(pass) and 603 cancer genes (driv) across element types: promoters, 5’ UTRs, 
CDS and 3’ UTRs. The left panel shows the results for all samples and the right 
panel corresponds tothe set of samples with high sensitivity at ERT hotspots. 
Boxes show the interquartile range and median; outliers are shownas circles. 
Weighted sensitivity means are shownat the top of the box plot. 
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Extended Data Fig. 11| See next page for caption. 


-log) observed P value 


Sanger simulation 


ActiveDriverWGS (\=0.38) 
DriverPower (\=0.0) 
ExinAtor (\=0.59) * 
MutSig (\=0.27) 
NBR (\=0.43) 
ANdScv (\=0,5) oe 
nedDetect (\=0.76) 
oncodriveFML_cadd (\=0.71) 
oncodriveFML_vest3 (\=0.71) 


1 2 3 
-logip observed P value 


Simulated Difference 


-1-05 0 05 


Spearman 
correlation 


Broad simulation 


DKFZ simulation 


Sanger simulation 


1 


Extended Data Fig. 11| Pvalue combination details. a, Quantile-quantile 
plots of Pvalues reported by various driver-detection algorithms on the three 
simulated datasets (Broad, DKFZ and Sanger; shown for coding regions 
(n=20,172) inthe meta-carcinoma cohort; see Methods for details for the 
statistical background modelor test of each algorithm) showed no major 
enrichment of mutations above the background rate. Results generally 
followed the expected null (uniform) distribution, and the Pvalues reported on 
simulated data were subsequently used to assess the covariance of method 
results. b, Quantile-quantile plots of integrated P values using the Brown and 
Fisher methods for combining P values across the results from different driver- 
detection algorithms were generated for a few representative tumour cohorts 
(shown here for coding regions). Brown combined P values (light blue) 
generally followed the null distribution as expected, whereas Fisher combined 
Pvalues were significantly inflated (dark blue), confirming that dependencies 
existed between the results reported by the various driver-detection 
algorithms. To simplify the integration procedure, we calculated covariances 
using P values from the observed data instead of simulated data and found that 


the integrated results based on the observed covariances (first column of 
plots) were essentially the same as the results obtained using the simulated 
covariances (second, third, and fourth columns of plots). c, Triangular heat 
maps showing the Spearman correlations of Pvalues among the various driver- 
detection methods in observed versus simulated data (coding regions 
(n=20,172), colorectal adenocarcinoma cohort) are highly similar. Differences 
inthe observed and simulated correlation values (shown in the heat maps on 
the far right) were minimal, and thus the final integration of Pvalues across 
methods was performed using covariances estimated on observed data. 

d, Brown combined P values based on observed and simulated covariance 
estimations (shown on the right, top heat map, for coding regions in 
glioblastoma) did not differ noticeably. In cases in which individual methods 
reported results that yielded substantially fewer hits than the median across all 
methods (bottom heat map, methods in light grey with results in dashed box), 
removing the methods from the integration did not affect the number of 
significant genes identified (right column of results in bottom heat map, shown 
for coding regions in lung adenocarcinoma). Number of coding regions asinc. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 
n/a | Confirmed 

| x The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 

[| xX A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


‘i ms The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[_||X] A description of all covariates tested 
| Xl A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


O x A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


O x For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


| xX For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


| ]|DX] Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Data and metadata were collected from International Cancer Genome Consortium (ICGC) consortium members using custom software 
packages designed by the ICGC Data Coordinating Centre. The general-purpose core libraries and utilities underlying this software have 
been released under the GPLv3 open source license as the "Overture" package and are available at https://www.overture.bio. Other data 
collection software used in this effort, such as |CGC-specific portal user interfaces, are available upon request to contact@overture.bio. 


Data analysis The workflows executing core WGS alignment, QC and variant-calling software are packaged as executable Dockstore images and 
available at: https://dockstore.org/search ?labels.value.keyword=pcawg&searchMode=files. Individual software components are as 
follows: BWA-MEM v0.78.8-r455; DELLY v0.6.6; ACEseq v1.0.189; DKFZ somatic SNV workflow v1.0.132-1; Platypus v0.7.4; ascatNgs 
v1.5.2; BRASS v4.012; grass v1.1.6; CaVEMan v1.50; Pindel v1.5.7; ABSOLUTE/JaBbA v1.5; SvABA 2015-05-20; dRanger 2016-03-13; 
BreakPointer 2015-12-22; MuTect v1.1.4; MuSE v1.0rc; SMuFIN 2014-10-26; OxoG 2016-4-28; VAGrENT v2.1.2; ANNOVAR v2014Nov12; 
VariantBAM v2017Dec12; SNV-Merge v2017May26; SV-MERGE v2017Dec12; DKFZ v2016Dec15 


The method for combining p-values is available from https://github.com/broadinstitute/getzlab-PCAWG-pvalue_combination/. Power 
calculations are available from https://github.com/broadinstitute/getzlab-PCAWG-power_calculations. The method for identifying 
significantly recurrent breakpoints by controlling for covariates is available at: https://github.com/mskilab/fish.hook. The method for 
permuting rearrangement breakpoint pairs to identify covariates affecting rearrangement partner selection is available as the "swap" 
module of: https://github.com/walaj/ginseng. The method for identify significantly recurrent rearrangements is available as the "2D" 
method at: https://github.com/ofershapira/SVsig. The method for identifying cis-expression consequences of SVs, CESAM, is available at: 
https://bitbucket.org/weischen/cesam 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Data 


Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


WGS somatic and germline variant calls, mutational signatures, subclonal reconstructions, transcript abundance, splice calls and other core data generated by the 
ICGC/TCGA Pan-cancer Analysis of Whole Genomes Consortium are available for download at https://dcc.icgc.org/releases/PCAWG. Additional information on 
accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/. In accordance with the data access policies of the ICGC and TCGA 
projects, most molecular, clinical and specimen data are in an open tier which does not require access approval. To access potentially identification information, 
such as germline alleles and underlying sequencing data, researchers will need to apply to the TCGA Data Access Committee (DAC) via dbGaP (https:// 
dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the dataset, and to the ICGC Data Access Compliance Office (DACO; http:// 
icgc.org/daco) for the ICGC portion. In addition, to access somatic single nucleotide variants derived from TCGA donors, researchers will also need to obtain dbGaP 
authorization. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size We compiled an inventory of matched tumour/normal whole cancer genomes in the ICGC Data Coordinating Centre. Most samples came 
from treatment-naive, primary cancers, but there were a small number of donors with multiple samples of primary, metastatic and/or 
recurrent tumours. Our inclusion criteria were: (i) matched tumour and normal specimen pair; (ii) a minimal set of clinical fields; and (iii) 
characterisation of tumour and normal whole genomes using Illumina HiSeq paired-end sequencing reads. 
For analyses specific to this study, we generated sample subsets as described in the methods. 


Data exclusions After quality assurance, data from 176 donors were excluded as unusable. Reasons for data exclusions included inadequate coverage, 
extreme bias in coverage across the genome, evidence for contamination in samples and excessive sequencing errors (for example, through 8- 
oxoguanine). 


Replication In order to evaluate the performance of each of the mutation-calling pipelines and determine an integration strategy, we performed a large- 
scale deep sequencing validation experiment. We selected a pilot set of 63 representative tumour/normal pairs, on which we ran the three 
core pipelines, together with a set of 10 additional somatic variant-calling pipelines contributed by members of the SNV Calling Working 
Group. Overall, the sensitivity and precision of the consensus somatic variant calls were 95% (CI90%: 88-98%) and 95% (CI90%: 71-99%) 
respectively for SNVs. For somatic indels, sensitivity and precision were 60% (34-72%) and 91% (73-96%) respectively. Regarding SVs, we 
estimate the sensitivity of the merging algorithm to be 90% for true calls generated by any one caller; precision was estimated as 97.5% - that 
is, 97.5% of SVs in the merged SV call-set have an associated copy number change or balanced partner rearrangement. 


Randomization No randomisation was performed. 


Blinding No blinding was undertaken. 
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Human research participants 


Policy information about studies involving human research participants 


Population characteristics Patient-by-patient clinical data are provided in the marker paper for the PCAWG consortium (Extended Data Table 1 of that 
manuscript). Demographically, the cohort included 1,469 males (55%) and 1,189 females (45%), with a mean age of 56 years 
range, 1-90 years). Using population ancestry-differentiated single nucleotide polymorphisms (SNPs), the ancestry distribution 
was heavily weighted towards donors of European descent (77% of total) followed by East Asians (16%), as expected for large 
contributions from European, North American and Australian projects. We consolidated histopathology descriptions of the 
umour samples, using the ICD-0-3 tumour site controlled vocabulary. Overall, the PCAWG data set comprises 38 distinct tumour 
ypes. While the most common tumour types are included in the dataset, their distribution does not match the relative 
population incidences, largely due to differences among contributing ICGC/TCGA groups in numbers sequenced. 


Recruitment Patients were recruited by the participating centres following local protocols. 


Ethics oversight The Ethics oversight for the PCAWG protocol was undertaken by the TCGA Program Office and the Ethics and Governance 
Committee of the ICGC. Each individual ICGC and TCGA project that contributed data to PCAWG had their own local 
arrangements for ethics oversight and regulatory alignment. 


=] 
ra¥) 
i 
(S 
(a>) 
= 
OD 
Ua) 
O 
ra) 
= 
ia) 
=i 
o 
1S) 
(S) 
Br 
= 
(Co) 
n 
(cS 
S 
S| 
ra) 
> 
< 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Article 


Patterns of somatic structural variation in 
human cancer genomes 


https://doi.org/10.1038/s41586-019-1913-9 


Received: 22 September 2017 


Accepted: 18 November 2019 


Published online: 5 February 2020 


Open access 


Yilong Li’?“, Nicola D. Roberts", Jeremiah A. Wala***™“, Ofer Shapira®*°", 

Steven E. Schumacher*“*, Kiran Kumar®*°*, Ekta Khurana‘, Sebastian Waszak’, Jan O. Korbel’, 
James E. Haber®, Marcin Imielinski®, PCAWG Structural Variation Working Group”, 

Joachim Weischenfeldt", Rameen Beroukhim****, Peter J. Campbell"’* & 

PCAWG Consortium" 


Akey mutational process in cancer is structural variation, in which rearrangements 
delete, amplify or reorder genomic segments that range in size from kilobases to 
whole chromosomes’ ”. Here we develop methods to group, classify and describe 
somatic structural variants, using data from the Pan-Cancer Analysis of Whole 
Genomes (PCAWG) Consortium of the International Cancer Genome Consortium 
(ICGC) and The Cancer Genome Atlas (TCGA), which aggregated whole-genome 
sequencing data from 2,658 cancers across 38 tumour types®. Sixteen signatures of 
structural variation emerged. Deletions have a multimodal size distribution, assort 
unevenly across tumour types and patients, are enriched in late-replicating regions 
and correlate with inversions. Tandem duplications also have a multimodal size 
distribution, but are enriched in early-replicating regions—as are unbalanced 
translocations. Replication-based mechanisms of rearrangement generate varied 
chromosomal structures with low-level copy-number gains and frequent inverted 
rearrangements. One prominent structure consists of 2-7 templates copied from 
distinct regions of the genome strung together within one locus. Such cycles of 
templated insertions correlate with tandem duplications, and—in liver cancer— 
frequently activate the telomerase gene TERT. A wide variety of rearrangement 
processes are active in cancer, which generate complex configurations of the genome 
upon which selection can act. 


Mutations that arise in somatic cells are the driving force of cancer 
development. Structural variation—in which genomic rearrangement 
acts to amplify, delete or reorder chromosomal material at scales that 
range from single genes to entire chromosomes-is an especially impor- 
tant class of somatic mutation. Previous analyses of both cancer and 
germline genomes have enabled the description of several distinctive 
patterns of structural variants’ ’, and hypotheses about the underlying 
basis of several of these patterns have been proposed on the basis of 
their clustering, orientation and associated copy-number changes. 
Hypothesis-driven in vitro studies are now beginning to reveal some 
of the mechanistic processes that generate these structures” °, and 
generate further predictions that can be assessed in the genomic data. 
However, the landscape of structural variation in human cancer remains 
incompletely mapped and there are many complex structures that 
elude formal description. 

The PCAWG Consortium aggregated whole-genome sequencing 
data from 2,658 cancers across 38 tumour types, generated by the ICGC 


and TCGA projects. These sequencing data were aligned to the human 
genome (reference build hs37d5) and analysed with standardized, high- 
accuracy pipelines to call somatic and germline variants of all classes®. 
Here, we analyse the patterns and signatures of structural variants 
across the PCAWG data. We propose a working classification scheme 
that encompasses known and newly identified classes of structural vari- 
ants. We develop methods for annotating the observed structural vari- 
ants ina given cancer genome, identifying a class of replication-based 
rearrangement processes that generate clusters of several structural 
variants. We explore the size, activity and genome-wide distribution of 
classifiable structural variant types across the cohort, using signature 
analysis to define how they correlate within patients. Other papers 
produced by PCAWG address complementary aspects of structural 
variants, including inference of positive selection acting on recurrently 
rearranged regions of the genome”, howstructural variants affect the 
transcriptome’ and chromosome topology”, patterns of somatic retro- 
transposition” and distribution of chromothripsis across cancer types”. 
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Fig. 1| Classification of structural variants in cancer genomes. Schematics of 
major structural-variant (SV) classes, grouped according to whether they are 
simple or complex and arise through cut-and-paste or copy-and-paste 
processes. Each schematic comprises three parts. The top segment shows 
dotted arcs for each rearrangementjunction that joins two chromosomal 
segments together. The middle segment shows the copy number of genomic 
segments that are involved. The bottom segment shows the configuration of 
the final derivative chromosome that results from the structural variant; the 
colour of the segments corresponds to the colour of that segment inthe copy- 
number schematic. + indicates the different derivative chromosomes created 
for some of the classes: that is, the structural variants are not phased toa single 
derivative. 


Classification of structural variants 


A‘structural variant’ manifests as a ‘junction’ between two ‘breakpoints’ 
inthe genome (terms in inverted commas here and below refer to those 
defined in the glossary in Extended Data Table 1). Generally, there will 
bea change in copy number across a given breakpoint if only one side 
of the break is rescued by a structural variant; if both sides of a double- 
stranded DNA break are rescued, a ‘reciprocal’ or ‘balanced’ struc- 
tural variant will result, without substantial copy-number change. We 
sometimes observe ‘clusters of structural variants’ in which several 
breakpoints occur close together, in time or in genomic space—usually 
both. Such spatial and/or temporal proximity generally, but not always, 
implies that the structural variants within a cluster are mechanistically 
linked. Clusters can be ‘phased’ (in which case all structural variants in 
the cluster resolve to a single derivative chromosome) or ‘unphased’, 
inwhich case the structural variants are carried on different derivative 
chromosomes. An example of the latter is a reciprocal translocation 
that results in two derivative chromosomes, each with a single inter- 
chromosomal breakpoint junction (Fig. 1). 


We recognize distinct ‘classes of structural variant’ from the orienta- 
tion of the two segments at the junction and associated copy-number 
changes (Fig. 1, Supplementary Fig. 1). Some classes of structural vari- 
ant (such as isochromosomes and rearrangements between extended, 
highly homologous sequences) are difficult to detect with short-read 
sequencing data; these classes are not considered further here. We 
propose categorizing classes of structural variant across two facets: the 
number of breakpoints involved (simple or complex) and by whether 
the patterns are likely to arise from ‘cut-and-paste’ or ‘copy-and-paste’ 
rearrangement processes. A cut-and-paste process generates a clus- 
ter of structural variants consistent with reshuffling or loss of extant 
genomic segments, anda copy-and-paste process is one in which cop- 
ies of genomic ‘templates’ are newly replicated or synthesized and 
inserted during the rearrangement process. Deletions, reciprocal 
inversions, unbalanced translocations and reciprocal translocations 
are examples of simple cut-and-paste structural variants, as they can 
be reconstructed from the incorrect religation of chromosomal breaks. 
Tandem duplications are simple copy-and-paste structural variants, as 
they arise through the local insertion of a newly generated extra copy 
of agenomic template. 

More-complex cut-and-paste processes that produce structural 
variants also occur in cancer. ‘Breakage-fusion-bridge’ events result 
from cycles of DNA breakage, end-to-end sister chromatid fusions, 
mitotic bridges and further DNA breakage. These events manifest as 
one ora few proximate, inverted breakpoint junctions with associated 
copy-number change, which we call ‘fold-back inversions” (Fig. 1). 
‘Chromoplexy*®”°—which is particularly frequent in prostate cancers— 
results from several simultaneous double-stranded DNA breaks in 
several chromosomes that are rejoined incorrectly, leading to balanced 
chains of rearrangements. ‘Chromothripsis”, in which chromosome 
shattering and rearrangement occur ina single catastrophic event?”, 
leads to a pattern of oscillating copy-number changes and localized 
clustering of tens to hundreds of breakpoints”. 

Inthe germline, more-complex copy-and-paste classes of structural 
variant have previously been described, which involve small duplica- 
tions andtriplications and are thought to arise from the stalling of the 
replication fork leading to template switching*”>”*. Here we describe 
a wide range of complex copy-and-paste types of somatic structural 
variant that occur inhuman cancers, and that are typically character- 
ized by copy-number gains and frequent inverted rearrangements. 


Annotation of structural-variant classes 


We analysed 2,559 whole cancer genomes across 38 tumour types 
(alongside matched germline DNA) that passed the most stringent 
PCAWG quality-control criteria: 1 or more somatic structural variants 
were detected in 2,429 tumours’. As described in an accompanying 
Article’, structural variants were identified using aberrantly map- 
ping and/or split reads in paired-end sequencing data’. We used four 
somatic structural-variant callers’? ”, and the final structural-variant 
dataset comprised events that were returned by =2 callers, merged 
by a graph-based consensus method®. We consider only somatically 
acquired structural variants in this analysis, and exclude somatic 
retrotransposition events. Validation of structural-variant calls was 
undertaken using both manual inspection and pull-down with rese- 
quencing of breakpoints. With these approaches, we estimate the 
sensitivity of the consensus structural-variant call set to be 90% for 
true calls generated by any lof the 4 callers; specificity was estimated 
as 97.5%8. A mean of 3.22 algorithms of the 4 that we used called each 
structural variant in the consensus set genome-wide, and this dif- 
fered little across repetitive elements: the mean for short interspersed 
nuclear elements was 3.22, and the mean for long interspersed nuclear 
elements was 3.21. 

Because the structural variants from a given cancer are often highly 
clustered, we grouped rearrangements into clusters onthe basis of the 
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Fig. 2| Frequency ofstructural-variant classes across tumour types. a, Violin 
plots of density of classified structural-variant categories across patients 
within each histology group. Tumour type panels are sorted in descending 
order of the average number of structural-variant breakpoints per sample. 
Within eachtumour type, the frequency distribution (y axis) of different 
structural-variant categories (x axis) across patients is shown asa density: 
regions of highest density have the greatest width of shaded area. Ineach 
panel, the number of patients is indicated at the top right. AdenoCA, 
adenocarcinoma; BNHL, B-cell non-Hodgkin lymphoma; ChRCC, 
chromophobe renal cell carcinoma; CLL, chronic lymphocytic leukaemia; CNS, 
central nervous system; GBM, glioblastoma; HCC, hepatocellular carcinoma; 
leiomyo, leiomyosarcoma; medullo, medulloblastoma; MPN, 
myeloproliferative neoplasm; eso, oesophageal; oligo, oligodendrocytic; panc, 
pancreatic; piloastro, pilocytic astrocytoma; prost, prostate; RCC, renal cell 
carcinoma; sarc, sarcoma; SCC, squamous cell carcinoma; TCC, transitional cell 
carcinoma; thy, thyroid. b, Per-sample counts of complex (bottom) and 
classified (top) structural-variant breakpoint junctions for oesophageal 
adenocarcinoma. c, Per-sample counts of complex (bottom) and classified 
(top) structural-variant breakpoint junctions for ovarian adenocarcinoma. 


proximity of breakpoints, the overall number of events in that genome 
and the size distribution of these events (Supplementary Methods). 
Essentially, a particular cluster contains structural variants that are 
significantly closer together than expected by chance, given the overall 
number and orientation of structural variants in that patient. Alongside 
the clustering, we computed anin silico library of all possible genomic 
configurations that result from sequential simple structural variants 
(deletions, tandem duplications, inversions, translocations, and chro- 
mosome duplications or losses), to a depth of five rearrangements. 
We could then compare the genomic configuration of each observed 
cluster of structural variants against the library to determine how it 
might have arisen. 

This methodology has the advantage that breakpoint junctions are 
classified according to the wider genomic context in which they occur. 
This means that, for example, true deletions will be identifiably dif- 
ferent from breakpoint junctions that happen to have a deletion-type 
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orientation but arise within (for instance) a chromothripsis event of 
markedly different mechanism and properties. Over half the break- 
point junctions that we observed arise within clusters of several or 
many structural variants (Fig. 2a): removing these junctions from the 
catalogues of true deletions, tandem duplications and inversions ena- 
bles a more-precise description of the properties of simple structural 
variants. 

Among the classes of simple structural variants, deletion was the 
most common, followed by tandem duplication and then unbalanced 
translocation. Reciprocal translocations and reciprocal inversions 
were uncommon events (Fig. 2a). There was considerable variability 
in the overall numbers and distribution of classes of structural variant 
across tumour types and across patients within a given tumour type 
(Extended Data Fig. 1). For example, oesophageal adenocarcinomas 
were characterized by many deletions and a large number of complex 
clustered rearrangements (Fig. 2b), and ovarian cancers often carried 
high numbers of tandem duplications and/or deletions with moderate 
numbers of unbalanced translocations (Fig. 2c). 


Cycles of templated insertions 


We next examined clusters that contain 2-10 structural variants. One 
newly identified configuration consisted of several segments of copy- 
number gains, typically on different reference chromosomes, linked 
together through structural variants (Fig. 3, Extended Data Fig. 2). 
A sequential path through consecutive segments can be formed by 
following the breakpoint junctions, which suggests that each clus- 
ter represents a string of duplicated templates inserted into a single 
derivative chromosome, probably acquired concurrently. Although it 
is theoretically possible that the structural variants in such clusters are 
not phased onthe same derivative chromosome or do not occur con- 
currently, we think this is unlikely for several reasons. First, we found 
examples of RNA transcripts that spliced together exons separated 
by two junctions in the structural-variant cluster (Supplementary 
Fig. 2), which suggests that they are phased on the same derivative 
chromosome. Second, long-read sequencing data (reported in an 
accompanying Article®) supported the phasing of structural variants 
that link templated insertions. Third, we found that the clonal frac- 
tion of tumour cells tended to be more similar for structural variants 
within these clusters than for randomly chosen structural variants 
in each patient (Supplementary Fig. 3), which suggests that they co- 
occur in evolutionary time. Fourth, the level of copy-number gain 
for individual segments in the cluster tended to be identical (Fig. 3, 
Extended Data Fig. 2). 

We define three basic categories on the basis of whether or not the 
string of inserted segments returns to the original chromosome: we 
term strings of inserted segments that do not return ‘chains’ of tem- 
plated insertions and those strings that do return ‘bridges’ (which 
leave a gap on the host chromosome) or ‘cycles’ (which rereplicate 
a segment on the host chromosome). In the PCAWG dataset overall, 
we observed 1,467 cycles and 1,275 bridges of templated insertions 
(Fig. 3a, b, Extended Data Fig. 2). In chains of templated insertions, 
the string of genomic segments does not return to the chromosome 
of departure (Fig. 3c, Extended Data Fig. 2) but it is similarly associ- 
ated with copy-number gains at each templated segment. There were 
285 instances of such chains in the dataset, commonly manifesting as 
unbalanced translocations joined through one or more intermediary 
templated insertions. 

Most templated insertion events involve only two breakpoint 
junctions, but this can extend to three, four or more linked rear- 
rangements (Extended Data Fig. 3a). The longest such event—from 
a cervical squamous cell cancer—had seven templated insertions 
strung together on an eighth host chromosome (Fig. 3c; other 
examples of long templated insertion events are shown in Extended 
Data Fig. 3). 
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Fig. 3| Chains, cycles and bridges of templated insertions. a—c, Examples ofa 
typical cycle (a), chain (b) and bridge (c) of templated insertions. The estimated 
copy-number profile is shown as in Fig. 1, with structural variants shown as 
dotted arcs linking two copy-number segments. The derivative 


Templated insertions that affect TERT 


Structural variants drive tumour development through their effects 
oncancer genes, whether by altering gene copy number, disrupting 
tumour-suppressor genes, creating fusion genes or juxtaposing the 
coding sequence of one gene with the regulatory apparatus of another. 
We found that many liver cancers had cycles of templated insertions 
that affect TERT (Fig. 3d, e, Extended Data Fig. 4). Point mutations in 
the TERT promoter are present in 54% of liver cancers, anda further 
5-10% of liver cancers have structural variants that activate the gene”’. 
Of the 30 patients with liver cancer that had structural variants that 
affect TERT, we find that 10 of these variants were templated inser- 
tion events (mostly cycles). All of these events duplicated the entire 


chromosome(s) that could explain the copy-number and structural-variant 
profile is shown below. d, e, Cycles of templated insertions that affect the TERT 
gene, intwo hepatocellular carcinomas. KIAA1024 is also known as MINARI. 


TERT gene and linked it to duplications of whole genes, fragments of 
genes or regulatory elements from elsewhere in the genome, and led 
to increased expression of TERT (Extended Data Fig. 4e). Thus, this 
particular rearrangement process is distinctive for the precision with 
which cancer copy-and-pastes normally disparate functional elements 
of its genome together without wholesale instability. 
Tumour-suppressor genes were also inactivated by templated inser- 
tions (Extended Data Fig. 5). For example, among many straightforward 
deletions, RBI was hit by cycles of templated insertions, a templated 
insertion with deletion and one instance of the linked, inverted dupli- 
cations detailed in ‘Local n-jumps and local-distant clusters’. These 
events typically generated duplications of internal exons in RBI and/or 
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Fig. 4 | Examples of clusters of 2-5 rearrangements seen in human cancers. a, 
Structures created by two local rearrangements that cannot easily be 
explained by simple structural-variant classes (which we call local 2-jumps). 
The estimated copy-number profile is shown as in Fig. 1, with structural 
variants shown as dotted arcs linking two copy-number segments. Possible 
configurations of the derivative chromosome are shown below; multiple 


insertions of exons from other genes, all of which presumably rendered 
anon-functional transcript. 


Local n-jumps and local-distant clusters 

Many clusters of 2-10 structural variants in the dataset were confined 
toasingle genomic region. Of those clusters that comprised two local 
rearrangements, some had straightforward explanations, such as 
nested or adjacent tandem duplications. However, many did not have 
atrivial explanation (Fig. 4a). These included a duplication-inverted- 
triplication—duplication structure that has previously been observed in 
germline structural variants” (349 instances); a structure of two dupli- 
cations linked by inverted rearrangements (531 instances); and struc- 
tures of copy-number loss plus nearby duplication linked by inverted 
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solutions are possible for each example. Dup, duplication; invDup, duplication 
linked by inverted rearrangement; trp, triplication. b, Structures created by 
3-4 local rearrangements that cannot easily be explained by simple structural- 
variant categories. c, Structures created by one local rearrangement and one 
rearrangement that reaches elsewhere in the genome (local-distant clusters). 


rearrangements (472 instances). All of these patterns had solutions in 
which breakpoints were phased to a single derivative chromosome 
(Fig. 4a), although non-phased solutions are theoretically possible 
(if unlikely). Beyond clusters of two rearrangements (two-jumps), we 
also found examples involving three, four or more rearrangements 
confined to one genomic locale (Fig. 4b). All of these configurations 
of clusters of structural variants can be phased to a single derivative 
chromosome, with tightly grouped breakpoints. 

Beyond clusters confined to a single genomic region, we found 
clusters of 2-10 structural variants that combined local jumps 
with rearrangements that reach into one or more distant regions 
of the genome (Fig. 4c). Simple examples of these events include 
unbalanced translocations or large deletions with a locally derived 
fragment inserted at the breakpoint, but there was also an extensive 
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Fig. 5| Size distribution and genomic properties of classified structural 
variants. a, Size distribution of deletions per histology group, with tumour 
types ordered according to total number of events seen. Vertical dashed lines 
represent the two prominent modes. b, Size distribution of segments of 
templated insertion per histology group. For each tumour type, the three 
distributions for cycles, bridges and chains of templated insertions are 
superimposed. Ins, insertion. c, Associations between a subset of the genomic 
properties (rows) and classes of structural variant (columns). Each density 
curve represents the quantile distribution of the genomic property values at 
observed breakpoints compared to random genome positions. Asterisks 
indicate a significant departure from uniform quantiles after multiple 
hypothesis correction ona one-sided Kolmogorov-Smirnov test based ona 
sample size of 2,559 genomes containing structural variants: *false-discovery 
rate < 0.01, **false-discovery rate < 0.001, ***false-discovery rate <10 °. Cells 
with significant property associations are shaded by the magnitude of the shift 
of the median observed quantile above (blue) or below (red) 0.5. The 


range of more-complex patterns. In some cases, the source of the 
inserted fragment was distal to the major break, and the structural vari- 
ant could feasibly result from several concurrent DNA breaks in close 
spatial proximity to the capture of ashort DNA fragment during repair 
(cut-and-paste). In other cases, the origin of the inserted fragment 
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interpretation of each property from left to right is indicated by the axes to the 
right of the property label. Complex uncl, complex clusters unclassified; cplxy, 
chromoplexy; del, deletion; inv, inversion; ins, insertion; LAD, lamina- 
associated domain; recip, reciprocal; TAD, topologically associated domain; 
TD, tandem duplication; trans, translocation; unbal, unbalanced. 

d, Rearrangement counts asa function of bases of junction microhomology, fit 
to three linear functions consistent with different formation mechanisms. 
NHEJ, non-homologous end joining; MMEJ, microhomology-mediated 

end joining; SSA, single-strand annealing. e, Enrichment or depletion of 
breakpoint junctions between regions of the genome with particular 
annotations, compared witha permuted background that preserves 
breakpoint positions but swaps breakpoint partners. Centre points are the 
mean fold change over the permuted background; error bars represent three 
s.d. Analysis is based onasample size of 2,559 genomes containing structural 
variants. LTR, long terminal repeat; SINE, short interspersed nuclear element; 
LINE, long interspersed nuclear element; heterochrom, heterochromatin. 


was proximal to the major break and associated with a gain in copy 
number. This pattern is difficult to explain by a cut-and-paste mecha- 
nism, because the copy-number gain implies the inserted segment 
was a duplicate of the original template rather than a separated frag- 
ment redistributed from its original locus. Instead, acopy-and-paste 
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mechanism may be the more parsimonious explanation for these 
events. 

A comparison of local footprints linked together through distant 
rearrangements revealed a strong connectivity of footprints with the 
same or similar structure, often enriched tenfold or more than expected 
by chance (see ‘Footprint connectivity analysis’ in Supplementary 
Results). The reasons for this are unclear, but it may reflect innate struc- 
tural symmetry introduced through the generation or the resolution 
of rearrangements, or through the repeated action of a mechanism 
that imparts consistent structural motifs. 


Copy-and-paste patterns of clusters 


The diverse patterns of 2-10 clustered structural variants (Figs. 3, 4) 
share important morphological features: (1) genomic configurations 
that can be phased to a single derivative chromosome; (2) low-level 
gains in copy number, especially duplications and triplications; (3) a 
high frequency of inverted rearrangements in addition to noninverted 
rearrangements; (4) occurrence on achromosome background with 
similar average copy number to the tumour overall; and (5) tight prox- 
imity of breakpoints within the local footprint (typically <1 Mb). 

Using our in silico library of genomic configurations, we could define 
all possible routes by which sequential structural variants could gener- 
ate these structures through the classically defined repertoire of dele- 
tion, tandem duplication, inversion and translocation (Supplementary 
Fig. 4). These routes typically would require implausible machinations 
of chromosomes (Supplementary Results). In particular, the high preva- 
lence of inverted breakpoint junctions and local copy-number gains is 
difficult to recreate using sequential simple rearrangements. Simple 
inversion events are uncommon in cancers (Fig. 1d) and they tend notto 
generate copy-number gains, except through breakage-fusion-bridge 
cycles: these latter also cause terminal deletions”, which are not seen 
in the events discussed here. 

If these events cannot be satisfactorily explained by sequential 
simple rearrangements, another possible explanation is a complex 
cut-and-paste mechanism such as chromothripsis, chromoplexy or 
repeated breakage-fusion-bridge cycles. However, the patterns of 
the 2-10 clustered structural variants do not fit with these processes 
either (Supplementary Results). Although chromothripsis with copy- 
number gain has previously been described*”””, the resulting copy 
number and rearrangement patterns have different properties to those 
we observed. Chromoplexy, in which chromosome breaks lead toa 
balanced interchange at multiple breakpoint junctions*”°, typically gen- 
erates unphased solutions. Repeated breakage-fusion—-bridge cycles 
tend to cause high-level copy-number gains associated with inverted, 
fold-back rearrangements?”, unlike the structures reported here. 

Instead, we believe that many of these locally complex clusters of 
structural variants with low-level copy-number gains are generated 
in a single event by a copy-and-paste process. That is, the copying of 
genomic templates is an intrinsic aspect of the structural variation pro- 
cess in these events, with the extra copies being inserted in the resulting 
derivative chromosome. If the genomic templates all originate locally, 
we would observe local n-jumps (suchas in Fig. 3a, b) witha tight cluster- 
ing of breakpoints, phased solutions, frequent copy-number gains and 
amix of inverted and noninverted breakpoint junctions. If the original 
templates for the copied segments derive from across the genome, 
chains, cycles and bridges of templated insertions would arise (Fig. 2). 


Genomic properties of structural variants 


The size of tandem duplications and deletions followed complex—often 
multimodal—distributions across tumour types (Fig. 5a, Extended 
Data Fig. 6a). However, as previously reported®”, individual patients 
tend to havea simpler—usually unimodal—distribution of deletions or 
tandem duplications (Extended Data Fig. 6b), which implies that the 
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complexity seenina given tumour type results from combining samples 
with different profiles. The sizes of individual fragments in templated 
insertion events were also distinctly multimodal, with varying peak 
heights across tumour types (Fig. 5b). When correlating template sizes 
within a given event, two patterns emerged: one in which template 
sizes were closely correlated with one another, and one in whichasmall 
(<1 kb) template was linked with one of any size (Extended Data Fig. 7a, 
b). Likewise, the sizes of segments within a given local two-jump event 
showed moderately strong correlations with one another (Extended 
Data Fig. 7c). 

Anumber of genomic properties (such as replication timing, tran- 
scriptional activity and chromatin state) influence the density of point 
mutations?! and copy-number alterations”, but how this relates to 
individual classes of structural variant is unclear. From the literature, 
we compiled a library of the genome-wide distribution of 38 features 
including replication timing, GC content, repeat density, gene density 
and distance to G-quadruplex motifs, among others. Replication timing 
had the strongest association with the occurrence of structural vari- 
ants; deletions are enriched in late-replicating regions, and tandem 
duplications and unbalanced translocations occur preferentially in 
early-replicating regions (Fig. 5c, Extended Data Fig. 8). For individual 
patients with high numbers of deletions or tandem duplications, we 
observed notable heterogeneity in the distribution of these struc- 
tural variants according to replication timing: some had events that 
occurred predominantly in late-replicating regions, others had events 
that occurred exclusively in early-replicating regions, and in others 
events were distributed more evenly (Supplementary Fig. 5). Regions 
of active chromatin and increased gene density correlated positively 
with the rate of rearrangement. 

A structural variant requires DNA repair pathways to join two 
sequences together, and several repair mechanisms are available to 
somatic cells. Some require sequence homology between the two ends, 
and others can operate to join non-homologous sequences. As previ- 
ously reported””>”?, we find across the PCAWG data that many structural 
variants do not have sequence homology at the breakpoint junction 
(Fig. 5d) and therefore arise through non-homologous end joining. 
Nonetheless, a sizable fraction of structural variants has more micro- 
homology than expected by chance, with an apparently bimodal dis- 
tribution of microhomology lengths. One set of structural variants 
has 2-7 bp of microhomology, probably generated by microhomol- 
ogy-mediated end joining, and a second set of structural variants has 
10-30 bp of microhomology, probably generated through single-strand 
annealing or other forms of homologous recombination (including 
microhomology-mediated break-induced replication). Repetitive 
sequences inthe genome, suchas short and long interspersed nuclear 
elements, are the likely substrate of such structural variants, and we 
find enrichment for structural variants joining such elements (Fig. 5e, 
Supplementary Fig. 6). 


Signatures of structural variation 


The heterogeneous spectrum of point mutations across cancers can 
be reconstructed from the differential action of a relatively limited 
repertoire of mutational processes, each with a characteristic signa- 
ture**. The differences across patients in the size distribution of tandem 
duplication and deletion—together with the widely varying frequency 
and patterns of structural variant across tumour types and genome 
topology—suggested that we could similarly learn such correlations 
across individual classes of structural variant. 

We divided the set of structural variants of each patient into mutu- 
ally exclusive categories. We split the most frequent classes of simple 
structural variant (deletions and tandem duplications) into 11 catego- 
ries according to size, replication timing and occurrence at fragile 
sites. Other configurations of structural variants and copy-number 
changes seen more than 50 times in the cohort were included as further 
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Fig. 6 | Structural-variant signatures in human cancers. a, The 12 most 
distinctive structural-variant signatures extracted by the Bayesian hierarchical 
Dirichlet process algorithm, run ona sample size of 2,559 genomes containing 
structural variants. Here the lengths of the bars represent the estimated 
proportion of each event class assigned to each signature (rows sum to one); 
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from the Markov chain. FB, fold-back; mid, mid-sized. b, Association of 
pathogenic mutations (germline and somatic combined) in key DNA repair 
genes with structural-variant signatures. The sample size of patients who have 


categories, including cycles, chains and bridges of templated insertions 
(also split by size), local n-jumps and local-distant clusters. 

We applied two methods for signature discovery, which yielded 
comparable results. We identified 16 structural-variant signatures: 
the 12 most prevalent of these signatures are shown in Fig. 6a. Signature 
extraction onthe cohort randomly split into two halves identified ten 
highly correlated signatures (Supplementary Fig. 7), which closely 
matched the signatures called in the full cohort despite the lower power. 
Three signatures of deletions emerged, split by size: the signature of 
small (<50-kb) deletions included small reciprocal inversions and the 
signature of large (>500-kb) deletions included large reciprocal inver- 
sions. This implies that the frequencies of deletions and reciprocal 
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pathogenic variants in the specific genes assessed is shown in brackets after 
each gene label (y axis). Hypothesis tests and effect sizes for each gene are 
derived from linear models for signature intensity after correction for 
histology. Significant associations from two-sided tests with correction for 
multiple hypothesis testing are shown. The colour and size of the points 
represent the estimated effect sizes. MSH refers to MSH2, MSH3, MSH4 and 
MSH6, genes in the mismatch repair pathway; FANC refers to genes associated 
with Fanconi anaemia, namely FANCA, FANCC, FANCD2, FANCE, FANCF, FANCG, 
FANCI, FANCL and FANCM. 


inversions are correlated across the cohort, and both follow similar 
size distributions within an individual patient. 

We identified five signatures of tandem duplications, split by size 
and replication timing. Cycles, bridges and chains of templated inser- 
tions were particularly prominent in signatures of early-replicating 
tandem duplications, whereas local two-jump structures were more 
closely associated with late-replicating tandem duplications. All of 
these patterns exemplify the copy-and-paste concept, in which extra 
copies of genomic templates are produced and inserted as an integral 
feature of the structural-variant process. 

Another signature was characterized by deletions and tandem dupli- 
cations at chromosomal fragile sites. Tandem duplications were more 
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prominent at the edges of the fragile site, and deletions were concen- 
trated in the centre (Extended Data Fig. 9a, b). The size range of fragile 
site deletions peaked at around 100 kb, similar to the larger deletion 
signature, whereas the rarer fragile-site tandem duplications showed 
no strong size peak (Extended Data Fig. 9c). Sites of fragility varied 
extensively across tumour types (Extended Data Fig. 9d). 

Unbalanced translocations comprised their own signature, which 
suggests that they derive from a distinct rearrangement process in 
cancer genomes. A further signature comprised both the fold-back 
inversions that are a hallmark of breakage-fusion-bridge cycles and 
similar structures such as translocations adjacent to fold-back inver- 
sions. Finally, there was a signature of balanced rearrangements, 
including reciprocal translocations and chromoplexy clusters>. This 
signature probably arises from several double-stranded DNA breaks 
(potentially occurring in interphase), in which both sides of the break 
are incorrectly repaired through ligation to other, simultaneously 
broken regions of the genome. 


DNA repair genes and tumour type 


We grouped annotations of pathogenic germline variants and somatic 
driver mutations in DNA-repair genes across the cohort®, correlat- 
ing their presence with activity of the structural-variant signatures 
(Fig. 6b). As previously described for breast and ovarian cancers®”, 
BRCA1 mutations are significantly associated with small tandem dupli- 
cation signatures, the mechanistic basis of which is increasingly well 
understood”. As previously described®**, CDK12 variants predicted 
signatures of mid-sized-to-large tandem duplications. BRCA2 variants 
correlated with small deletions, as expected from previous work”’, 
and also with the reciprocal structural-variant signature that includes 
chromoplexy. PALB2 variants showed the same correlations with sig- 
natures of small deletions and reciprocal structural variants as does 
BRCA2: PALB2 colocalizes with, stabilizes and assists BRCA2 during 
homologous recombination”, so we might have predicted that inactiva- 
tion of either gene would lead to a similar structural-variant signature. 
These associations between driver mutations and structural-variant 
signatures were consistently evident across many types of tumour 
(Extended Data Fig. 10). 

The structural-variant signatures showed considerable heterogene- 
ity in their activity across tumour types and among patients within a 
given tumour type (Supplementary Fig. 8). Tumours of the gastroin- 
testinal tract—including colorectal and oesophageal adenocarcino- 
mas-—showed high rates of the fragile-site signature. Prostate cancer 
was notable for the prevalence of the chromoplexy signature, as pre- 
viously reported*”°, and squamous cell carcinomas of the lung were 
characterized by the fold-back inversion signature. 

We assessed how classes of structural variant altered known cancer 
genes (Supplementary Table 1). Some cancer genes acquire oncogenic 
potential only with specific structural events, such as fusion genes or 
enhancer hijacking. Not surprisingly, these genes typically showed little 
variability in which classes of structural variant could generate such 
events (Extended Data Fig. 1la-c)—although there were exceptions. 
The TMPRSS2-ERG fusion gene of prostate cancer, for example, was 
generated by a range of processes (including simple deletions, chro- 
moplexy and chromothripsis), all of which are prevalent signatures in 
this tumour type (Extended Data Fig. 11d-f). 

Tumour-suppressor genes and recurrently amplified genes showed 
more variability in which types of structural variant were observed, 
and these were shaped by signatures active in the relevant tumour 
types. For example, the tumour-suppressor genes, PTEN and RADSIB, 
which are commonly inactivated in breast and ovarian cancers, were 
often targeted by tandem duplications generating out-of-frame exon 
duplications (Extended Data Fig. 12a, b). By contrast, deletions were 
the predominant events that inactivated SMAD4 and CDKN2A, in 
keeping with their prevalence in cancers of the gastrointestinal tract 
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(Extended Data Fig. 12c, d). MYC, one of the most commonly amplified 
genes across all types of cancer, showed considerable diversity in the 
mechanisms ofits rearrangement: nested tandem duplications in breast 
cancer, translocations or chromoplexy with/GH in lymphoma, as well 
as chromothripsis, cycles of templated insertions, local n-jumps and 
local-distant clusters in other types of tumour (Extended Data Fig. 13). 


Discussion 


We have described the patterns and signatures of structural variationin 
alarge cohort of uniformly analysed cancer genomes. A major grouping 
of patterns in structural variants that emerges from our study is onein 
which extra copies of genomic templates are inserted during the rear- 
rangement process. This includes simple events such as tandem dupli- 
cations, as well as a range of more-complex events with duplications and 
triplications that are rearranged locally as well as inserted distantly. Our 
signature analysis grouped a large proportion of these more-complex 
events together with tandem duplications, which suggests that they 
represent a continuum of processes that share underlying properties. A 
replication-based mechanism has previously been proposed to explain 
local two-jumps*”*™, in which stalled replication forks or other DNA 
lesions cause the DNA polymerase to switch templates and continue 
replication in anew location. Studies in experimental models are now 
revealing that a wide range of mechanisms and DNA lesions can result in 
templated insertions: these mechanisms include tandem duplications 
in BRCA1 deficiency”, translocations with templated insertions caused 
by dysregulated strand invasion*® and distant templated insertions in 
the absence of replication helicases”. 

Genomic instability in cancer is not a single phenomenon. Instead, 
many different mutational processes can act to restructure the genome 
and, in doing so, generate a notably flexible array of possible structures. 
Any given tumour draws ona subset of the available processes, shaped 
by the cell of origin, germline predisposition and other, unknown, fac- 
tors: selection then does the rest, promoting the clone that has chanced 
onthe structure that increases its potential for self-determination. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 

A detailed description of the methods used in this paper and many 
additional results are described in Supplementary Information. Here, 
we summarize the key aspects of the analysis. 


Generation of the structural-variant call set 

The final set of structural variants used in this Article was generated 
by the Technical Working Group of the PCAWG Consortium and is 
described in the main PCAWG paper®. In brief, four variant callers 
were used to identify somatically acquired structural variants from 
matched tumour and germline whole genome sequencing data: SVABA 
(Broad pipeline), DELLY (DKFZ pipeline), BRASS (Sanger pipeline) and 
dRanger (Broad pipeline). These were merged into a final call set using 
agraph-based algorithm to identify overlapping breakpoint junctions 
across algorithms. Detailed visual inspection of structural-variant calls 
suggested that a simple approach of accepting all structural-variant 
calls made by two or more of the four algorithms gave the best trade-off 
between sensitivity and specificity. 


Structural-variant clustering and annotation 

To identify clusters of structural variants, we developed a method 
for grouping structural variants into clusters and footprints to allow 
structural and mechanistic inferences to be made systematically. In 
parallel, we processed the somatic copy-number data and merged 
it with structural-variant junctions to enable us produce rearrange- 
ment patterns from the generated structural-variant clusters and 
footprints. We produced normalized representations of structural- 
variant cluster patterns, which enable us to tabulate the number of 
different cluster and footprint patterns and analyse their features. 
Finally, we performed manual and simulation-assisted interpretation 
of the recurrently observed cluster and footprint patterns. The indi- 
vidual steps of the structural-variant classification pipeline are outlined 
below and detailed in the subsequent subsections: (1) computing the 
exact breakpoint coordinates from clipped reads; (2) removing redun- 
dant ‘segment-bypassing’ structural variants; (3) merging rearrange- 
ment breakpoints with copy-number data to yield structural-variant 
breakpoint-demarcated, normalized, absolute copy-number data; (4) 
clustering individual structural variants into structural-variant clusters 
and footprints; (5) heuristically refining structural-variant clusters 
and footprints; (6) filtering artefactual fold-back-type structural vari- 
ants with insufficient support; (7) determining balanced overlapping 
breakpoints (this step is to distinguish very short templated insertions 
from mutually overlapping balanced breakpoints); and (8) computing 
rearrangement patterns and categories. 


Distribution of structural variants across the genome 

We divided the hg19 human reference genome (autosomes and chromo- 
some X) into 3,036,315 pixels of 1kb, and calculated a suite of metrics 
per pixel to summarize a variety of genome properties with potential 
relevance to the distribution of rearrangements, as listed in the Sup- 
plementary Information. Properties were matched as closely as possible 
to the tissue of origin for cancer samples from the PCAWG data. All 
other genome properties were held fixed across all tissues. To test for 
associations between structural-variant event classes and the library 
of genome properties, the genome property metrics were compared 
between real structural-variant positions (randomly choosing one side 
of each breakpoint junction to reduce dependence between observa- 
tions) and one million uniform random positions from the callable 
genome space. To compare the tissue-specific properties, each ran- 
dom position was assigned a random tissue type, drawing from the 
observed tissue-type distribution in the structural-variant call set. 


For each genome property and each event class, the real observations 
were pooled amongst the random ones, and then rank-transformed 
and normalized onascale from O to1. Under the null hypothesis of no 
event-versus-property association, the ranks of the real observations 
would follow a uniform distribution. We tested this in each case with 
a Kolmogorov-Smirnov test then applied a Benjamini-Yekutieli cor- 
rection for false-discovery rate across the entire suite of tests and set 
the threshold for significance reporting at 0.01. 


Structural-variant-signature analysis 

We used two algorithms for extracting structural-variant signatures. 
Both used the same input files, comprising a matrix of counts per 
patient (across all patients) of structural-variant clusters falling into 
anumber of mutually exclusive categories. These categories included 
the major classes of structural variants, with the more-common events 
(deletions, tandem duplications and inversions) split by size and/or 
replication timing. The two algorithms that were used for extracting 
the signatures were (1) a hierarchical Dirichlet process and (2) non- 
negative matrix factorization. Further details on the implementation 
of these algorithms are available in the Supplementary Information. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Somatic and germline variant calls, mutational signatures, subclonal 
reconstructions, transcript abundance, splice calls and other core data 
generated by the ICGC/TCGA PCAWG Consortium are described in an 
accompanying Article® and are available for download at https://dcc. 
icgc.org/releases/PCAWG. Additional information on accessing the 
data, including raw read files, can be found at https://docs.icgc.org/ 
pcawg/data/. In accordance with the data access policies of the ICGC 
and TCGA projects, most molecular, clinical and specimen data areinan 
open tier that does not require access approval. To access information 
that could potentially identify participants, such as germline alleles 
and the underlying sequencing data, researchers will need to apply to 
the TCGA data access committee via dbGaP (https://dbgap.ncbi.nlm. 
nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the 
dataset, and to the ICGC data access compliance office (http://icgc.org/ 
daco) for the ICGC portion of the dataset. In addition, to access somatic 
single-nucleotide variants derived from TCGA donors, researchers will 
also need to obtain dbGaP authorization. 


Code availability 


The core computational pipelines used by the PCAWG Consortium 
for alignment, quality control and variant calling are available to the 
public at https://dockstore.org/search?search=pcawg under the GNU 
General Public License v.3.0, which allows for reuse and distribution. 
These are described in detail in an accompanying Article’. The code 
for grouping structural variants into structural-variant clusters and 
footprints is available at https://github.com/cancerit/ClusterSV/ (ver- 
sion 1.0). The code for simulating rearrangements can be found at 
https://github.com/cancerit/SimSvGenomes (version 1.0). The code 
for sampling from the hierarchical Dirichlet process for identification 
of mutational signatures is implemented as an R package at https:// 
github.com/nicolaroberts/hdp (version 0.1.1). 
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Extended Data Fig. 1| Per-sample counts of structural-variant breakpoint 
junctions by histology group. Counts of simple, classified structural variants 
are shown above the x axis and counts of complex breakpoint junctions below 


thexaxis. Patients within each tumour type are ranked by frequency of simple 
structural variants. 
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Extended Data Fig. 2 | Further examples of templated insertion chains, cycles and bridges. Schematics follow the same structure as in Fig. 3. 
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Extended Data Fig. 3 | Number of breakpoint junctionsincycles, bridgesand —_inalltumourtypesinthe cohort. b,c, Two examples of particularly long cycles 
chains of templated insertions. a, Histogram of numbers of breakpoint of templated insertions in the cohort. Examples are depicted ina similar 
junctions in templated insertion cycles, chains and bridges across all samples manner to those in Fig. 3. 
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Extended Data Fig. 4 | Templated insertion events that activate TERTin 
hepatocellular carcinoma. a, The positions of all structural-variant 
breakpoints inthe 7ERT region inthe PCAWG cohort (including 50-kb flanks 
either side of TERT), coloured by classification and vertically spaced by the 
distance to the next breakpoint in the cohort. If the two sides ofa breakpoint 
junction are contained within the plotting window, they are joined by acurved 
line. The number of samples with a breakpoint in the plotting windowis 
annotated in the table in the top left. b-d, Examples of two cycles anda chain of 
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TERT alteration status 


templated insertions that affect TERT in hepatocellular carcinomas. e, 
Expression levels of TERT in patients with hepatocellular carcinoma 

(n=187 patients), separated by whether TERT was wild type, had an activating 
promoter point mutation, structural variants ina templated insertion or other 
class. Individual patient data are shownas points. The box shows the median 
expression level asa thick black line, with the range of the box denoting the 
interquartile range. The whiskers show the range of data or 1.5x the 
interquartile range (whichever is lower). 
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Extended Data Fig. 5 | Templated insertion events inactivating RB1 in breast 
and ovarian carcinomas. a, The positions of all structural-variant breakpoints 
in the RBI region inthe PCAWG cohort (including 50-kb flanks either side of 


RB1), coloured by classification and vertically spaced by the distance to the 
next breakpoint in the cohort. If the two sides of a breakpoint junction are 


contained within the plotting window, they are joined by a curved line. The 
number of samples with a breakpoint in the plotting window is annotated inthe 
tablein the top left. b—e, Examples of three cycles and a bridge of templated 
insertions that affect RBI in breast and ovarian carcinomas. 
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Extended Data Fig. 8 | Relationship of an extended panel of genomic one-sided Kolmogorov-Smirnov test, based ona sample size of 2,559 genomes 
properties with structural-variant categories. Associations betweenasubset containing structural variants: *false-discovery rate < 0.01, **false-discovery 
of the genomic properties (rows) and classes of structural variant (columns). rate <0.001, ***false-discovery rate <10°°. Cells with significant property 

Each density curve represents the quantile distribution of the genomic associations are shaded by the magnitude of the shift of the median observed 
property values at observed breakpoints, compared to random genome quantile above (blue) or below (red) 0.5. The interpretation of each property 
positions. Asterisks indicate significant departures from uniform quantiles from left to right is indicated by the axes to the right of the property label. 
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Extended Data Fig. 9| Properties of structural variants at chromosomal 
fragile sites. a, Structural-variant breakpoints in the most affected fragile 
sites: FHIT, MACROD2 and WWOx. These are coloured by classification and 
vertically spaced by the distance to the next breakpoint in the cohort. Ifthe two 
sides of abreakpoint junction are contained within the plotting window, they 
are joined by acurved line. The number of samples with a breakpoint in the 
plotting windowis annotated in the tables at the top left. b, Number of 
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deletions and tandem duplications (top) and number of affected samples 
(bottom) for the 18 fragile sites considered in this analysis. c, Size distribution 
of deletions and tandem duplications in fragile sites (FS) compared tothe rest 
of the genome. d, Fragile-site preference for 20 cancer histology groups as 
indicated by the proportion of samples that contains a deletion in each of the 
18 fragile sites considered here. The number of samples is indicated in 
parentheses. 
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Extended Data Fig. 10 | Consistency of associations between signatures and ovarian, prostate, lung squamous and so on). b, Box-and-whisker plots as fora, 
mutations in DNA-repair genes. a, Box-and-whisker plotsshowingthenumber —showingthenumber of structural variants attributed to the small-deletion 


of structural variants attributed to the small-deletion signature in different signature in different types of tumour, split by PALB2 status. c, Box-and-whisker 
types of tumour, split by BRCA2 status (BRCA2 wild type in orange; plots as fora, showing the number of structural variants attributed to the early- 
BRCA2 mutant in cyan). The box denotes the interquartile range, withthe replicating, small-tandem-duplication signature in different types of tumour, 
median marked asa horizontal line. The whiskers extend as far as the range or split by BRCA1 status. d, Box-and-whisker plots as for a, showing the number of 
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Extended Data Fig. 11| See next page for caption. 


chri0 


i 
chr14 


chr8 


Extended Data Fig. 11| Patterns of structural variants causing fusion genes 
and enhancer hijacking. a, Rainfall plot of structural-variant breakpoints in 
the genes KIAA1549 and BRAF, commonly fused together through atandem 
duplication in pilocytic astrocytomas. Structural variants are coloured by 
classification and arranged vertically by the distance to the next breakpoint in 
the cohort. Ifthe two sides of a breakpoint junction are contained within the 
plotting window, they are joined by acurved line. The number of samples witha 
breakpoint in the plotting window is annotated in the table at the top of each 
panel. b, Rainfall plot of structural-variant breakpoints that affect RET, 
commonly fused to CCDC6 by inversion in papillary thyroid cancer. c, Rainfall 


plot of structural-variant breakpoints that affect BCL2, commonly hijacked to 
the /GH immunoglobulin locus by translocations in B cell lymphomas. d, 
Rainfall plot of structural-variant breakpoints that affect ERG, commonly fused 
with TMPRSS2by deletion or more-complex events in prostate 
adenocarcinoma. e, Example of a TMPRSS2-ERG fusion gene ina prostate 
adenocarcinoma created bya chromoplexy cycle. The estimated copy-number 
profile is shownas black horizontal segments, with structural variants shown 
as dotted arcs linking the edges of two copy-number segments. f, Example ofa 
TMPRSS2-ERG fusion gene ina prostate adenocarcinoma created by 
chromothripsis. 
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Extended Data Fig. 13 | Examples of structural variants increasing the copy number of MYC. The estimated copy-number profile is shown as black horizontal 
segments, with structural variants shown as dotted arcs linking the edges of two copy-number segments. 
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Extended Data Table 1| Glossary of key terms 


Term 


Description 


Structural variant (SV) 


Breakpoint 


Copy number 
alteration (CNA) 


Reciprocal or balanced 
SV 


Juxtaposition of non-contiguous chromosomal segments through a process of genomic 
rearrangement. 


The chromosomal position at which a DNA break is made. Each SV consists of a junction 
between two breakpoints in different regions of the genome. 


Change in the number of copies of a given chromosomal segment from that expected. 


A pair of SVs in which both sides of a single dsDNA break are rescued in the 
rearrangement. Typically used to describe some inversions and some translocations. 


Unbalanced SV An SV (usually inversion or translocation) in which only one side of the dsDNA break is 
rescued, thereby generating a copy number alteration across the breakpoint. 

Cluster of SVs A set of SVs that are closer together in genomic space than expected by chance. Typically, 
such clustering implies a shared mechanistic basis for the SV generation. 

Derivative A chromosome that carries one or more SVs. 

chromosome 

Phased SVs Set of SVs and copy number alterations in a cluster carried on a single derivative 
chromosome. 

Chromosomal A contiguous stretch of DNA that is of constant copy number, used to denote the regions 

segment of chromosome between SVs. 

Template A region of chromosomal DNA that is copied and inserted elsewhere in the genome. 

SV class A type of structural variant, such as deletion, tandem duplication or translocation. 

Deletion Loss of a segment of chromosome from the genome spanned by a junction between the 


Tandem duplication 


Reciprocal inversion 


Fold-back inversion 


Translocation 


Breakage-fusion- 
bridge cycle 


Chromoplexy 


Chromothripsis 


Local n-jump 


Cycle, chain or bridge 
of templated insertions 


Local-distant cluster 


two breakpoints either side. 


Extra copy of a segment of chromosome in which the duplicated region is inserted 
immediately adjacent to the original template in the same orientation. 


A segment of chromosomal DNA inserted into its original position, but in the opposite 
orientation. 


An inverted rearrangement between two breakpoints typically <20kb apart on the 
chromosome, with associated copy number change. Often a sign of breakage-fusion- 
bridge cycles. 


Breakpoint junction between two different chromosomes, either reciprocal or unbalanced. 


SV mechanism in which a naked DNA end (breakage) is copied to its sister chromatid 
during S phase, with the two ends undergoing fusion (by fold-back inversion). At 
anaphase, the resulting dicentric chromosome is stretched between the two daughter cells 
(bridge), leading to further DNA breakage and potentially further cycles. 


A set of >2 reciprocal SVs in which the chromosomal ends either side of each breakpoint 
are shuffled such that every end is rescued in a rearrangement junction. 


A cluster of many SVs (10s to 100s) in one or a few chromosomes, occurring in a single 
catastrophic event, with oscillating copy number profile and rearrangement junctions of all 
four possible orientations. 


A cluster of n SVs in a single genomic region, typically phased to a single derivative 
chromosome, exhibiting some copy number gains and junctions with inverted and non- 
inverted orientation. 


Copies of one or more genomic templates drawn from across the genome, strung together 
in a contiguous string and inserted into a single derivative chromosome. A chain of 
templated insertions does not return to the original chromosome, leading to an unbalanced 
translocation. A cycle has a duplication on the host chromosome, while a bridge inserts 
the template copies into a deletion on the host chromosome. 


A cluster of SVs that has both local rearrangements and rearrangements to other parts of 
the genome. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
Lo AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
— Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Workflow Step Algorithm Version Dockstore Package* 
WGS Alignment BWA-MEM v0.7.8-r455 https://goo.gl/oqp4Xxd 
EMBL/DKFZ SV caller DELLY v0.6.6 https://goo.gl/Y46MCo 
EMBL/DKFZ SCNA caller ACEseq v1.0.189 https://goo.gl/4z0V42 
EMBL/DKFZ SNV caller DKFZ somatic SNV workflow 1.0.132-1 
EMBL/DKFZ indel caller Platypus v0.7.4 
Sanger SCNA caller ascatNgs v1.5.2 https://goo.gl/9DSrbA 
Sanger SV caller BRASS v4.012 
grass v1.1.6 
Sanger SNV caller CaVEMan v1.50 
Sanger indel caller Pindel v1.5.7 
Broad SCNA caller ABSOLUTE/JaBbA v1.5/? https://goo.gl/YkdtDt 
Broad SV caller SVABA/dRanger/BreakPointer 2015-05-20/2016-03-13/2015-12-22 
Broad SNV caller MuTect v1.1.4 
Broad indel caller SVABA 2015-05-20 
MuSE SNV caller MuSE v1.0rc https://goo.gl/SSR4bF 
SMUFIN indel caller SMuFIN 2014-10-26 https://goo.gl/EuUP5k 
Oxidative artefact filter OxoG 2016-4- 28 https://goo.gl/cUKP9K 
SNV/Indel annotation VAGrENT v2.1.2 https://goo.gl/9DSrbA 
ANNOVAR v2014Nov12 https://goo.gl/4z0V42 
miniBAM generation VariantBAM v2017Dec12 https://goo.gl/S8h8e5 
SNV/Indel merging and consensus generation SNV-MERGE v2017May26 https://goo.gl/TETSB8 
SV merging and consensus generation SV-MERGE v2017Dec12 https://goo.gl/A9CEup 
Strand bias filter DKFZ Strand Bias Filter v2016Dec15 https://goo.gl/8jXrvZ 
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Data analysis We have described the algorithms in detail throughout the manuscript. Signatures analysis used the published NMF algorithm (PMID: 
23318258) and an unpublished Bayesian method, released on github (https://github.com/nicolaroberts/hdp). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The PCAWG-generated alignments, variant calls, annotations and derived data sets are available for general research use for browsing and download at http:// 
dcc.icgc.org/pcawg/. In accordance with the data access policies of the ICGC and TCGA projects, most molecular, clinical and specimen data are in an open tier 
which does not require access approval. To access potentially identifying information, such as germline alleles and underlying read data, researchers will need to 
apply to the TCGA Data Access Committee (DAC) via dbGaP (https://dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the data set, 
and to the ICGC Data Access Compliance Office (DACO; http://icgc.org/daco) for the ICGC portion. 
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Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 
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For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size We compiled an inventory of matched tumour/normal whole cancer genomes in the ICGC Data Coordinating Centre. Most samples came 
from treatment-naive, primary cancers, but there were a small number of donors with multiple samples of primary, metastatic and/or 
recurrent tumours. Our inclusion criteria were: (i) matched tumour and normal specimen pair; (ii) a minimal set of clinical fields; and (iii) 
characterisation of tumour and normal whole genomes using IIlumina HiSeq paired-end sequencing reads. 
We collected genome data from 2,834 donors, representing all ICGC and TCGA donors that met these criteria at the time of the final data 
freeze in autumn 2014. 


No formal power calculations were performed to decide sample size for PCAWG - we aggregated all genomes available at the time. 


Data exclusions After quality assurance, data from 176 donors were excluded as unusable. Reasons for data exclusions included inadequate coverage, 
extreme bias in coverage across the genome, evidence for contamination in samples and excessive sequencing errors (for example, through 8- 
oxoguanine). Data exclusion criteria were pre-established. 


Replication In order to evaluate the performance of each of the mutation-calling pipelines and determine an integration strategy, we performed a large- 
scale deep sequencing validation experiment. We selected a pilot set of 63 representative tumour/normal pairs, on which we ran the three 
core pipelines, together with a set of 10 additional somatic variant-calling pipelines contributed by members of the SNV Calling Working 
Group. To assess accuracy of SV calls, we therefore used the property that an SV must either generate a copy number change or be balanced, 
whereas artefactual calls will not respect this property. For individual SV callers, we estimated the true positive rate to be in the range 80-95% 
for samples in the pilot-63 dataset. 


Randomization — Not applicable - this was a descriptive study. 


Blinding Not applicable - this was a descriptive study. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
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Population characteristics Patient-by-patient clinical data are provided in the mar 


Recruitment Patients were recruited by the participating centres fol 


Ethics oversight 


(range, 1-90 years). Using population ancestry-differen 


tumour samples, using the ICD-O-3 tumour site control 


types. While the most common tumour types are included in the dataset, t 


er paper for the PCAWG consortium (Extended Data Table 1 of that 
manuscript). Demographically, the cohort included 1,469 males (55%) and 1,189 females (45%), with a mean age of 56 years 
iated single nucleotide polymorphisms (SNPs), the ancestry distribution 
was heavily weighted towards donors of European descent (77% of total) followed by East Asians (16%), as expected for large 
contributions from European, North American and Australian projects. We consolidated histopathology descriptions of the 

ed vocabulary. Overall, the PCAWG data set comprises 38 distinct tumour 


heir distribution does not match the relative 


population incidences, largely due to differences among contributing ICGC/TCGA groups in numbers sequenced. 


owing local protoco 


s. Samples obtained had to meet criteria on amount 


of tumour DNA available, meaning that the cohort is potentially somewhat biased towards larger tumours. Otherwise, we 


anticipate no major recruitment biases. 


Committee of the ICGC. 


The Ethics oversight for the PCAWG protocol was undertaken by the TCGA 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Cancer develops througha process of somatic evolution’”. Sequencing data froma 
single biopsy represent a snapshot of this process that can reveal the timing of specific 
genomic aberrations and the changing influence of mutational processes’. Here, by 
whole-genome sequencing analysis of 2,658 cancers as part of the Pan-Cancer 
Analysis of Whole Genomes (PCAWG) Consortium of the International Cancer 
Genome Consortium (ICGC) and The Cancer Genome Atlas (TCGA)*, we reconstruct 
the life history and evolution of mutational processes and driver mutation sequences 
of 38 types of cancer. Early oncogenesis is characterized by mutations in a constrained 
set of driver genes, and specific copy number gains, such as trisomy 7 in glioblastoma 
and isochromosome 17q in medulloblastoma. The mutational spectrum changes 
significantly throughout tumour evolution in 40% of samples. A nearly fourfold 
diversification of driver genes and increased genomic instability are features of later 
stages. Copy number alterations often occur in mitotic crises, and lead to 
simultaneous gains of chromosomal segments. Timing analyses suggest that driver 
mutations often precede diagnosis by many years, if not decades. Together, these 
results determine the evolutionary trajectories of cancer, and highlight opportunities 
for early cancer detection. 


Similar to the evolution in species, the approximately 10” cells in the 
human body are subject to the forces of mutation and selection’. This 
process of somatic evolution begins in the zygote and only comes to 
rest at death, as cells are constantly exposed to mutagenic stresses, 
introducing 1-10 mutations per cell division”. These mutagenic forces 
lead to a gradual accumulation of point mutations throughout life, 
observed ina range of healthy tissues” “ and cancers”. Although these 
mutations are predominantly selectively neutral passenger mutations, 


about the times when these lesions arise during somatic evolution and 
where the boundary between normal evolution and cancer progression 
should be drawn. 

Sequencing of bulk tumour samples enables partial reconstruction of 
the evolutionary history of individual tumours, based on the catalogue 
of somatic mutations they have accumulated**. These inferences 
include timing of chromosomal gains during early somatic evolution’®, 
phylogenetic analysis of late cancer evolution using matched primary 


and metastatic tumour samples from individual patients” ”°, and tem- 
poral ordering of driver mutations across many samples”), 


some are proliferatively advantageous driver mutations”. The types 
of mutation in cancer genomes are well studied, but little is known 
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Fig. 1| Timing clonal copy number gains using allele frequencies of point 
mutations. a, Principles of timing mutations and copy number gains based on 
whole-genome sequencing. The number of sequencing reads reporting point 
mutations can be used to discriminate variants as early or late clonal (green or 
purple, respectively) in cases of specific copy number gains, as well as clonal 
(blue) or subclonal (red) in cases without. b, Annotated point mutationsin one 
sample based on VAF (top), copy number (CN) state and structural variants 
(middle), and resulting timing estimates (bottom). LOH, loss of heterozygosity. 
c, Overview of the molecular timing distribution of copy number gains across 
cancer types. Pie charts depict the distribution of the inferred mutation time 
for agiven copy number gain ina cancer type. Green denotes early clonal gains, 
witha gradient to purple for late gains. The size of each chart is proportional to 
the recurrence of this event. Abbreviations for each cancer type are defined 


The PCAWG Consortium has aggregated whole-genome sequenc- 
ing data from 2,658 cancers’, generated by the ICGC and TCGA, and 
produced high-accuracy somatic variant calls, driver mutations, and 
mutational signatures*”?™ (Methods and Supplementary Information). 

Here, we leverage the PCAWG dataset to characterize the evolution- 
ary history of 2,778 cancer samples from 2,658 unique donors across 
38 cancer types. We infer timing and patterns of chromosomal evolu- 
tion and learn typical sequences of mutations across samples of each 
cancer type. We then define broad periods of tumour evolution and 
examine how drivers and mutational signatures vary between these 
epochs. Using clock-like mutational processes, we map mutation timing 
estimates into approximate real time. Combined, these analyses allow 
ustosketch out the typical evolutionary trajectories of cancer, and map 
them in real time relative to the point of diagnosis. 


Reconstructing the life history of tumours 


The genome of acancer cellis shaped by the cumulative somatic aber- 
rations that have arisen during its evolutionary past, and part of this 
history can be reconstructed from whole-genome sequencing data? 
(Fig. 1a). Initially, each point mutation occurs ona single chromosome 


in Supplementary Table1.d, Heat maps representing molecular timing 
estimates of gains on different chromosome arms (x axis) for individual 
samples (y axis) for selected tumour types. e, Temporal patterns of two 
near-diploid cases illustrating synchronous gains (top) and asynchronous 
gains (bottom). f, Left, distribution of synchronous and asynchronous gain 
patterns across samples, split by WGD status. Uninformative samples 

have too few or too small gains for accurate timing. Right, the enrichment 

of synchronous gains in near-diploid samples is shown by systematic 
permutation tests. g, Proportion of copy number segments (n= 90,387) with 
secondary gains. Error bars denote 95% credible intervals. ND, near diploid. 
h, Distribution of the relative latency of n=824 secondary gains with available 
timing information, scaled to the time after the first gain and aggregated per 
chromosome. 


ina single cell, which gives rise to a lineage of cells bearing the same 
mutation. If that chromosomal locus is subsequently duplicated, any 
point mutation on this allele preceding the gain will subsequently be 
present onthe two resulting allelic copies, unlike mutations succeeding 
the gain, or mutations onthe other allele. As sequencing dataenablethe 
measurement of the number of allelic copies, one can define categories 
of early and late clonal variants, preceding or succeeding copy number 
gains, as well as unspecified clonal variants, which are common to all 
cancer cells, but cannot be timed further. Lastly, we identify subclonal 
mutations, whichare present in only a subset of cells and have occurred 
after the most recent common ancestor (MRCA) of all cancer cells in 
the tumour sample (Supplementary Information). 

The ratio of duplicated to non-duplicated mutations within a gained 
region can be used to estimate the time point when the gain hap- 
pened during clonal evolution, referred to here as molecular time, 
which measures the time of occurrence relative to the total num- 
ber of (clonal) mutations. For example, there would be few, if any, 
co-amplified early clonal mutations if the gain had occurred right 
after fertilization, whereas a gain that happened towards the end of 
clonal tumour evolution would contain many duplicated mutations 
(Fig. la, Methods). 
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Fig. 2| Timing of point mutations shows that recurrent driver gene 
mutations occur early. a, Top, distribution of point mutations over different 
mutation periods inn=2,778 samples. Middle, timing distribution of driver 
mutations inthe 50 most recurrent lesions across n= 2,583 white listed 
samples from unique donors. Bottom, distribution of driver mutations across 
cancer types; colour as defined in the inset. b, Relative timing of the 50 most 
recurrent driver lesions, calculated as the odds ratio of early versus late clonal 
driver mutations versus background, or clonal versus subclonal. Error bars 
denote 95% confidence intervals derived from bootstrap resampling. Odds 
ratios overlapping lin less than 5% of bootstrap samples are considered 
significant (coloured). The underlying number of samples witha given 
mutation is shown ina.c, Relative timing of 7P53 mutations across cancer 
types, asin b. The number of samples is defined in the x-axis labels. 

d, Estimated number of unique lesions (genes) contributing 50% of all driver 
mutations in different timing epochs across n=2,583 unique samples, 
containing n=5,756 driver mutations with available timing information. Error 
bars denote the range between 0 and 1 pseudocounts; bars denote the average 
of the two values. NA, not applicable; NS, not significant. 


These analyses are illustrated in Fig. 1b. As expected, the variant 
allele frequencies (VAFs) of somatic point mutations cluster around 
the values imposed by the purity of the sample, local copy number 
configuration and identified subclonal populations. The depicted 
clear cell renal cell carcinoma has gained chromosome arm 5q at an 
early molecular time as part of an unbalanced translocation t(3p;5q), 
whichconfirms the notion that this lesion often occurs in adolescence 
inthis cancer type’. Ata later time point, the sample underwent a whole 
genome duplication (WGD) event, duplicating all alleles, including the 
derivative chromosome, ina single event, as evidenced by the mutation 
time estimates of all copy number gains clustering around asingle time 
point, independently of the exact copy number state. 


Timing patterns of copy number gains 


To systematically examine the mutational timing of chromosomal gains 
throughout the evolution of tumours inthe PCAWG dataset, we applied 
this analysis to the 2,116 samples with copy number gains suitable for 
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timing (Supplementary Information). We find that chromosomal gains 
occur across a wide range of molecular times (median molecular time 
0.60, interquartile range (IQR) 0.10-0.87), with systematic differences 
between tumour types, whereas within tumour types, different chro- 
mosomes typically show similar distributions (Fig. 1c, Extended Data 
Figs. 1, 2, Supplementary Information). In glioblastoma and medul- 
loblastoma, a substantial fraction of gains occurs early in molecular 
time. By contrast, in lung cancers, melanomas and papillary kidney 
cancers, gains arise towards the end of the molecular timescale. Most 
tumour types, including breast, ovarian and colorectal cancers, show 
relatively broad periods of chromosomal instability, indicating a very 
variable timing of gains across samples. 

There are, however, certain tumour types with consistently early 
or late gains of specific chromosomal regions. Most pronounced is 
glioblastoma, in which 90% of tumours contain single copy gains of 
chromosome 7, 19 or 20 (Fig. 1c, d). Notably, these gains are consistently 
timed within the first 10% of molecular time, which suggests that they 
arise very early ina patient’s lifetime. In the case of trisomy 7, typically 
less than 3 out of 600 single nucleotide variants (SNVs) on the whole 
chromosome precede the gain (Extended Data Fig. 3a, b). On the basis 
of a mutation rate of 1 = 4.8 x 10°” to 3.0 x 10° SNVs per base pair per 
division®, this indicates that the trisomy occurs within the first 6-39 
cell divisions, suggesting a possible early developmental origin, in 
agreement with somatic mosaicisms observed in the healthy brain”®. 
Similarly, the duplications leading to isochromosome 17q in medul- 
loblastoma are timed exceptionally early (Extended Data Fig. 3c, d). 

Notably, we observed that gains in the same tumour often appear 
to occur at a similar molecular time, pointing towards punctuated 
bursts of copy number gains involving most gained segments (Fig. le). 
Although this is expected in tumours with WGD (Fig. 1b), it may seem 
surprising to observe synchronous gains in near-diploid tumours, par- 
ticularly as only 6% of co-amplified chromosomal segments were linked 
by a direct inter-chromosomal structural variant. Still, synchronous 
gains are frequent, occurring in 57% (468 out of 815) of informative 
near-diploid tumours, 61% more frequently than expected by chance 
(P<0.01, permutation test; Fig. 1f). Because most arm-level gains incre- 
ment the allele-specific copy number by 1 (80-90%; Fig. 1g), it seems 
that these gains arise through mis-segregation of single copies during 
anaphase. This notion is further supported by the observation that in 
about 85% of segments with two gains of the same allele, the second 
gain appears with noticeable latency after the first (Fig. 1h). Therefore, 
the extensive chromosome-scale copy number aberrations observed 
in many cancer genomes are seemingly caused by a limited number 
of events—possibly by merotelic attachments of chromosomes to 
multipolar mitotic spindles”, or as a consequence of negative selection 
of individual aneuploidies”’— offering an explanation for observations 
of punctuated evolution in breast and colorectal cancer?””°. 


Timing of point mutations in driver genes 


As outlined above, point mutations (SNVs and insertions and deletions 
(indels)) can be qualitatively assigned to different epochs, allowing 
the timing of driver mutations. Out of the 47 million point mutations 
in 2,583 unique samples, 22% were early clonal, 7% late clonal, 53% 
unspecified clonal and 17% subclonal (Fig. 2a). Among a panel of 453 
cancer driver genes, 5,913 oncogenic point mutations were identified’, 
of which 29% were early clonal, 5% late clonal, 56% unspecified clonal 
and 8% subclonal. It thus emerges that common drivers are enriched in 
the early clonal and unspecified clonal categories and depleted inthe 
late clonal and subclonal ones, indicating a preferential early timing 
(Fig. 2b). For example, driver mutations in 7P53 and KRAS are 12 and 8 
times enriched in early clonal stages, respectively. For 7P53, this trend 
is independent of tumour type (Fig. 2c). Mutations in PIK3CA are two 
times more frequently clonal than expected, and non-coding changes 
near the TERT gene are three times more frequently early clonal. 
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Fig. 3 | Aggregating single-sample ordering reveals typical timing of 

driver mutations. a, Schematic representation of the ordering process. 

b-d, Examples of individual patient trajectories (partial ordering relationships), 
the constituent data for the ordering model process. e-g, Preferential ordering 
diagrams for colorectal adenocarcinoma (ColoRect-AdenoCA) (e), pancreatic 


Aggregating the clonal status of all driver point mutations over 
time reveals an increased diversity of driver genes mutated at 
later stages of tumour development: 50% of all early clonal driver 
mutations occur in just 9 genes, whereas 50% of late and subclonal 
mutations occur in approximately 35 different genes each, a nearly 
fourfold increase (Fig. 2d). Consistent with previous studies of indi- 
vidual tumour types” ™, these results suggest that, in general, the 
very early events in cancer evolution occur ina constrained set of 
common drivers, and a more diverse array of drivers is involved in 
late tumour development. 


Relative timing of somatic driver events 


Although timing estimates of individual events reflect evolutionary 
periods that differ from one sample to another, they define in part 
the order in which driver mutations and copy number alterations have 
occurred in each sample (Fig. 3a—d). As confirmed by simulations, 
aggregating these orderings across samples defines a probabilistic 
ranking of lesions (Fig. 3a), recapitulating whether each mutation 
occurs preferentially early or late during tumour evolution (Extended 
Data Figs. 4, 5, Supplementary Information). 

In colorectal adenocarcinoma, for example, we find APC mutations 
to have the highest odds of occurring early, followed by KRAS, loss 
of 17p and 7P53, and SMAD4 (Fig. 3b, e). Whole-genome duplications 


Early «> Late/subclonal 


-/-9p21.3 
ep 
+7p11. 
~/-10923.31 — 
RBI 


15 20 25 1 5 10 15 20 25 
Early «—————-> Late/subclonal 


Relative rank (preference) 


*Colours for each mutation type 
m= Coding mutation 
m Copy number gain 
m Copy number loss 


m Promoter mutation 
= Copy number loss post WGD 
m Homozygous copy number loss 


neuroendocrine cancer (Panc-Endocrine) (f) and glioblastoma (CNS-GBM) (g). 
Probability distributions show the uncertainty of timing for specific 

events inthe cohort. Events with odds above 10 (either earlier or later) are 
highlighted. The prevalence of the event type in the cohort is displayed as a bar 
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occur after tumours have accumulated several driver mutations, 
and many chromosomal gains and losses are typically late. These 
results are in agreement with the classical APC-KRAS-TP53 pro- 
gression model of Fearon and Vogelstein®, but add considerable 
detail. 

In many cancer types, the sequence of events during cancer progres- 
sion has not previously been determined in detail. For example, in 
pancreatic neuroendocrine cancers, we find that many chromosomal 
losses, including those of chromosomes 2, 6, 11 and 16, are among 
the earliest events, followed by driver mutations in MENI and DAXX 
(Fig. 3c, f). WGD events occur later, after many of these tumours have 
reached a pseudo-haploid state due to widespread chromosomal 
losses. In glioblastoma, we find that the loss of chromosome 10, and 
driver mutations in 7P53 and EGFR are very early, often preceding 
early gains of chromosomes 7, 19 and 20 (Fig. 3d, g). Mutations in the 
TERT promoter tend to occur at early to intermediate time points, 
whereas other driver mutations and copy number changes tend to 
be later events. 

Across cancer types, we typically find 7P53 mutations among the 
earliest events, as well as losses of chromosome 17 (Supplementary 
Information). WGD events usually have an intermediate ranking, and 
most copy number changes occur later. Losses typically precede gains, 
and consistent with the results above, common drivers typically occur 
before rare drivers. 
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Fig. 4| Dynamic mutational processes during early and late clonal tumour 
evolution. a, Example of tumours with substantial changes between mutation 
spectra of early (left) and late (right) clonal time points. The attribution of 
mutations to the most characteristic signatures are shown. b, Example of clonal- 
to-subclonal mutation spectrum change. c, Fold changes between relative 
proportions of early and late clonal mutations attributed to individual 
mutational signatures. Points are coloured by tissue type. Data are shown for 
samples (n= 530) with measurable changes in their overall mutation spectra and 
restricted to signatures active in at least 10 samples. Box plots demarcate the first 
and third quartiles of the distribution, with the median shown in the centre and 
whiskers covering data within 1.5x the IQR from the box. d, Fold changes between 
clonal and subclonal periods in samples (n= 729) with measurable changes in 
their mutation spectra, analogous toc. 


Timing of mutational signatures 
The cancer genome is shaped by various mutational processes over 
its lifetime, stemming from exogenous and cell-intrinsic DNA dam- 
age, and error-prone DNA replication, leaving behind characteristic 
mutational spectra, termed mutational signatures™”**. Stratifying 
mutations by their clonal allelic status, we find evidence for a changing 
mutational spectrum between early and late clonal time points in 29% 
(530 out of 1,852) of informative samples (P< 0.05, Bonferroni-adjusted 
likelihood-ratio test), typically changing the spectrum by 19% (median 
absolute difference; range 4-66%) (Fig. 4a, b, Extended Data Fig. 6). 
Similarly, 30% of informative samples (729 out of 2,387) displayed 
changes of their mutation spectrum between the clonal and subclonal 
state, with median difference of 21% (range 3-72%). Combined, the 
mutation spectrum changes throughout tumour evolution in 40% of 
samples (1,069 out of 2,688). 

To quantify whether the observed temporal changes can be attrib- 
uted to known and suspected mutational processes, we decom- 
posed the mutational spectra at each time point into a catalogue of 
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57 mutational signatures, including double base substitution and indel 
signatures“ (Methods). 

In general, these mutational signatures display a predominantly 
undirected temporal variability over several orders of magnitude 
(Fig. 4c, d, Extended Data Fig. 7). In addition, several signatures dem- 
onstrate distinct temporal trends. As one may expect, signatures of 
exogenous mutagens are predominantly active in the early clonal 
stages of tumorigenesis. These include tobacco smoking in lung adeno- 
carcinoma (signature SBS4, median fold change 0.43, IQR 0.31-0.72), 
consistent with previous reports”, and ultraviolet light exposure in 
melanoma (SBS7; median fold change 0.16, IQR 0.09-0.43). Another 
strong decrease over time is found for a signature of unknown aetiol- 
ogy, SBS12, which acts mostly in liver cancers (median fold change 0.22, 
IQR 0.06-0.41). In chronic lymphoid leukaemia, there was a 20-fold 
relative decrease in mutations associated with somatic hypermuta- 
tion (SBS9; median fold change 0.05, IQR 0.02-0.43) from clonal to 
subclonal stages. 

Some mutational processes tend to increase throughout cancer 
evolution. For example, we see that APOBEC mutagenesis (SBS2 and 
SBS13) increases in many cancer types from the early to late clonal 
stages (median fold change 2.0, IQR 0.8-3.6), as does anewly described 
signature SBS38 (median fold 3.6, IQR 1.8-11). Signatures of defective 
mismatch repair (SBS6, 14, 15, 20, 21, 26 and 44) increase from clonal 
to subclonal stages (median fold 1.8, IQR 1.2-3.0). 


Chronological time estimates 


The molecular timing data presented above do not measure the 
occurrence of events in chronological time. If the rate at which 
mutations are acquired per year in each sample was constant, the 
chronological time would simply be the product of the estimated 
molecular timing and age at diagnosis. However, this relation will be 
nonlinear if the mutation rate changes over time, and is inflated by 
acquired mutational processes, as suggested by the analysis in the 
previous section. Some of these issues can be mitigated by count- 
ing only mutations contributed by endogenous and less variable 
mutational processes, such as CpG-to-TpG mutations (hereafter 
CpG>TpG) caused by spontaneous deamination of 5-methyl-cytosine 
to thymine at CpG dinucleotides, which have been proposed as 
a molecular clock”. Our supplementary analysis suggests that, 
although the baseline CpG>TpG mutation rate in cancers is very 
close to that in normal cells, there appears to be a moderate increase 
(1-10 times, adding between 20 and 40% of mutations) in cancers 
(Extended Data Fig. 8). As this shifts chronological timing estimates, 
we model different scenarios of the evolution of the CpoG>TpG muta- 
tion rate (Fig. 5a). 

Applying this logic to time WGDs, which yield sufficient numbers of 
CpG>TpG mutations, demonstrates that they occur several years and 
possibly evena decade or more before diagnosis in some cancer types, 
under arange of scenarios of mutation rate increase (Fig. 5b, Extended 
Data Fig. 9). A notable example is ovarian adenocarcinoma, which 
appears to have a median latency of more than 10 years. This holds 
true even under ascenario of aCpG>TpG rate increase of 20-fold, which 
would be far beyond the 7.5-fold rate increase observed in matched 
primary and relapse samples” (Extended Data Fig. 8f). Notably, these 
results suggest WGD may occur throughout the entire female repro- 
ductive life (Extended Data Fig. 9b). The latency between the MRCA 
and the last detectable subclone is shorter, typically several months 
to years (Fig. 5c). 

These timescales of cancer evolution are further supported by the 
fact that progression of most known precancerous lesions to carcino- 
mas usually spans many years, if not decades*° *. Our data corroborate 
these timescales and extend them to cancer types without detectable 
premalignant conditions, raising the hope that these tumours could 
also be detected in less malignant stages. 


a b 
SA520037, 72 years old, Ovarian-AdenoCA 40 

411 CpG>TpG mutations 1x 
WGD at 29% molecular time 


—30 
Rate § 2.5x CpG>TpG 
8 400 increase 2 rate 
2 MRCA 320 5x increase 
& 300 dog 2 
i=} g 7 ox 
& 200 S10 gL oflll 20x 
ao 20x 
e 
100 age a 4 ! 
0} 0 
& o SBS SE EERE % SER CLS = Ber 
CRE eo BO NEY Drow or ee »; 
010 30 50 70 SEES Sees GEOR Re SEES 


e 
POELN OO ENE owe 
Age ee ae eR SEL 
3 


2 | onglal 


SESSILIS: SIS SOO SESE. SO 


SPM SS iS 
SBE ARE % EME IEEE IK Sy SX OK 
x I) aS Lees & 2 x a 
FNS os S SEE ey oss “e Pa 
g go S 


Fig. 5 | Approximate chronological timing inference suggests a timescale of 
cancer evolution of several years. a, Mapping of molecular timing estimates to 
chronological time under different scenarios of increases in the CpG>TpG 
mutation rate. A greater increase before diagnosis indicates an inflation of the 
mutation timescale. b, Median latency between WGDs and the last detectable 
subclone before diagnosis under different scenarios of CpG>TpG mutation rate 
increases for n=569 non-hypermutant cancers with at least 100 informative 
SNVs, low tumour in normal contamination and at least five samples per 
tumour histology. c, Median latency between the MRCA and the last detectable 
subclone before diagnosis for different CpG>TpG mutation rate changes in 
n=1,921non-hypermutant samples with low tumour in normal contamination 
and at least 5 cases per cancer type. 


Discussion 


To our knowledge, our study presents the first large-scale genome-wide 
reconstruction of the evolutionary history of cancers, reconstructing 
both early (pre-cancer) and later stages of 38 cancer types. This is facili- 
tated by the timing of copy number gains relative to all other events 
inthe genome, through multiplicity and clonal status of co-amplified 
point mutations. However, several limitations exist (Supplementary 
Information). Perhaps most importantly, molecular timing is based 
on point mutations and is therefore subject to changes in mutation 
rate. Notably, healthy tissues acquire point mutations at rates not too 
dissimilar from those seen in cancers, particularly when considering 
only endogenous mutational processes, and furthermore, some tissues 
are riddled with microscopic clonal expansions of driver gene 
mutations* °”. This is direct evidence that the life history of almost 
every cell in the human body, including those that develop into cancer, 
is driven by somatic evolution. 

Together, the data presented here enable us to draw approximate 
timelines summarizing the typical evolutionary history of each cancer 
type (Fig. 6, Supplementary Information for all other cancer types). 
These make use of the qualitative timing of point mutations and copy 
number alterations, as well as signature activities, which can be inter- 
leaved with the chronological estimates of WGD and the appearance 
of the MRCA. 

Itis remarkable that the evolution of practically all cancers displays 
some level of order, which agrees very well with, and adds much detail 
to, established models of cancer progression®“*. For example, TP53 
with accompanying 17p deletion is one of the most frequent initiating 
mutations ina variety of cancers, including ovarian cancer, in whichit 
is the hallmark of its precancerous precursor lesions”. Furthermore, 
the list of typically early drivers includes most other highly recurrent 
cancer genes, suchas KRAS, TERT and CDKN2ZA, indicating a preferred 
role in early and possibly even pre-cancer evolution. This initially con- 
strained set of genes broadens at later stages of cancer development, 
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Fig. 6 | Typical timelines of tumour development. a-d, Timelines 
representing the length of time, in years, between the fertilized egg and the 
median age of diagnosis for colorectal adenocarcinoma (a), squamous cell lung 
cancer (b), ovarian adenocarcinoma (c) and pancreatic adenocarcinoma (d). 
Real-time estimates for major events, suchas WGD and the emergence of the 
MRCA, are used to define early, variable, late and subclonal stages of tumour 
evolution approximately in chronological time. The range of chronological 
time estimates according to varying clock mutation acceleration rates is shown 
as well, with tick marks corresponding to 1x, 2.5x, 5x, 7.5x, 10x and 20x. Driver 
mutations and copy number alterations (CNA) are shown in each stage 
according to their preferential timing, as defined by relative ordering. 
Mutational signatures (Sigs) that, on average, change over the course of tumour 
evolution, or are substantially active but not changing, are shownin the epochin 
which their activity is greatest. DBS, double base substitution; SBS, single base 
substitutions. Where applicable, lesions witha known timing from the literature 
are annotated; dagger symbols denotes events that were found to havea 
different timing; asterisk symbol denotes events that agree with our timing. 


suggesting an epistatic fitness landscape canalizing the first steps of 
cancer evolution. Over time, as tumours evolve, they follow increasingly 
diverse paths driven by individually rare driver mutations, and by copy 
number alternations. However, none of these trends is absolute, and 
the evolutionary paths of individual tumours are highly variable, show- 
ing that cancer evolution follows trends, but is far from deterministic. 

Our study sheds light on the typical timescales of in vivo tumour 
development, with initial driver events seemingly occurring up to 
decades before diagnosis, demonstrating how cancer genomes are 
shaped by a lifelong process of somatic evolution, with fluid bound- 
aries between normal ageing processes ™ and cancer evolution. 
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Nevertheless, the presence of genetic aberrations with such long 
latency raises hopes that aberrant clones could be detected early, 
before reaching their full malignant potential. 
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Methods 


Dataset 

The PCAWG series consists of 2,778 tumour samples (2,703 white listed, 
75 grey listed) from 2,658 donors. Allsamples inthis dataset underwent 
whole-genome sequencing (minimum average coverage 30x in the 
tumour, 25x in the matched normal samples), and were processed 
with a set of project-specific pipelines for alignment, variant calling, 
and quality control*. Copy number calls were established by combin- 
ing the output of six individual callers into a consensus using a multi- 
tier approach, resulting in a copy number profile, a purity and ploidy 
value and whether the tumour has undergone a WGD (Supplementary 
Information). Consensus subclonal architectures have been obtained 
by integrating the output of 11 subclonal reconstruction callers, after 
which all SNVs, indels and structural variants are assigned to a muta- 
tion cluster using the MutationTimer.R approach (Supplementary 
Information). Driver calls have been defined by the PCAWG Driver 
Working Group*, and mutational signatures are defined by the PCAWG 
Signatures Working Group™. A more detailed description can be found 
in Supplementary Information, section 1. 

Data accrual was based on sequencing experiments performed by 
individual member groups of the ICGC and TCGA, as described in an 
associated study’. As this is a meta-analysis of existing data, power 
calculations were not performed and the investigators were not blinded 
to cancer diagnoses. 


Timing of gains 
We used three related approaches to calculate the timing of copy num- 
ber gains (see Supplementary Information, section 2). In brief, the 
common feature is that the expected VAF of a mutation (F) is related to 
the underlying number of alleles carrying a mutation according tothe 
formula: E[X] =nmfp/[N (1- p) + Cp], inwhich Xis the number of reads, 
n denotes the coverage of the locus, the mutation copy number mis 
the number ofalleles carrying the mutation (which is usually inferred), 
fis the frequency of the clone carrying the given mutation (f= 1 for 
clonal mutations). Nis the normal copy number (2 on autosomes, 1 or 
2 for chromosome X and 0 or 1 for chromosome Y), Cis the total copy 
number of the tumour, and pis the purity of the sample. 

The number of mutations n,,at each allelic copy number m then 
informs about the time when the gain has occurred. The basic formulae 
for timing each gain are, depending onthe copy number configuration: 


Copy number 2 +1: 7=3n,/(2n, + n,) 
Copy number 2+ 2: T=2n,/(2n, + n,) 


Copy number 2+ 0: T=2n,/(2n, + n,) 


in which 2+ 1refers to major and minor copy number of 2 and 1, respec- 
tively. Methods differ slightly in how the number of mutations present 
oneachallele are calculated and how uncertainty is handled (Supple- 
mentary Information). 


Timing of mutations 

The mutation copy number mand the clonal frequency fis calculated 
according to the principles indicated above. Details can be foundin Sup- 
plementary Information, section 2. Mutations with f= 1 are denoted 
as ‘clonal’, and mutations with f< 1 as ‘subclonal’. Mutations with f=1 
and m>1are denoted as ‘early clonal’ (co-amplified). In cases with f=1, 
m=\1and C>2, mutations were annotated as ‘late clonal’, if the minor 
copy number was 0, otherwise ‘clonal’ (unspecified). 


Timing of driver mutations 
Acatalogue of driver point mutations (SNVs and indels) was provided 
by the PCAWG Drivers and Functional Interpretation Group*. The timing 


category was calculated as above. From the four timing categories, the 
odds ratios of early/late clonal and clonal (early, late or unspecified 
clonal)/subclonal were calculated for driver mutations against the 
distribution of all other mutations present in fragments with the same 
copy number composition in the samples with each particular driver. 
The background distribution of these odds ratios was assessed with 
1,000 bootstraps (Supplementary Information, section 4.1). 


Integrative timing 

For each pair of driver point mutations and recurrent copy number 
alterations, an ordering was established (earlier, later or unspecified). 
The information underlying this decision was derived from the timing 
of each driver point mutation, as well as from the timing status of clonal 
and subclonal copy number segments. These tables were aggregated 
across all samples anda sports statistics model was employed to calcu- 
late the overall ranking of driver mutations. A full description is given 
in Supplementary Information, section 4.2. 


Timing of mutational signatures 

Mutational trinucleotide substitution signatures, as defined by the 
PCAWG Mutational Signatures Working Group”, were fit to samples 
with observed signature activity, after splitting point mutations into 
either of the four epochs. A likelihood ratio test based on the multi- 
nomial distribution was used to test for differences in the mutation 
spectra between time points. Time-resolved exposures were calcu- 
lated using non-negative linear least squares. Full details are given in 
Supplementary Information, section 5. 


Real-time estimation of WGD and MRCA 

CpG>TpG mutations were counted in an NpCpG context, except for 
skin-melanoma, in which CpCpG and TpCpG were excluded owing to 
the overlapping UV mutation spectrum. For visual comparison, the 
number of mutations was scaled to the effective genome size, defined as 
the 1/mean(m,/C,), in which m,is the estimated number of allelic copies 
of each mutation, and C,is the total copy number at that locus, thereby 
scaling to the final copy number and the time of change. 

A hierarchical Bayesian linear regression was fit to relate the age at 
diagnosis to the scaled number of mutations, ensuring positive slope 
and intercept through a shared gamma distribution across cancer types. 

For tumours with several time points, the set of mutations shared 
between diagnosis and relapse (np) and those specific to the relapse (n,) 
was calculated. The rate acceleration was calculated as: a= Np/Np  ty/te. 
This analysis was performed separately for all substitutions and for 
CpG>TpG mutations. 

Onthe basis of these analyses, a typical increase of 5x for most cancer 
types was chosen, with a lower value of 2.5x for brain cancers anda 
value of 7.5x for ovarian cancer. 

The correction for transforming an estimate of acopy number gain 
in mutation time into chronological time depends not only onthe rate 
acceleration, but also on the time at which this acceleration occurred. 
As this is generally unknown, we performed Monte Carlo simulations 
of rate accelerations spanning an interval of 15 years before diagnosis, 
corresponding roughly to 25% of time for a diagnosis at 60 years of age, 
noting that a 5x rate increase over this duration yields an offset of about 
33% of mutations, compatible with our data. Subclonal mutations were 
assumed to occur at full acceleration. The proportion of subclonal 
mutations was divided by the number of identified subclones, thus 
conservatively assuming branching evolution. Full details are given 
in Supplementary Information, section 6. 


Cancer timelines 

The results from each of the different timing analyses are combined in 
timelines of cancer evolution for each tumour type (Fig. 6 and Supple- 
mentary Information). Each timeline begins at the fertilized egg, and 
spans up to the median age of diagnosis within each cohort. Real-time 
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estimates for WGD and the MRCA act as anchor points, allowing us 
to roughly map the four broadly defined time periods (early clonal, 
intermediate, late clonal and subclonal) to chronological time during a 
patient’s lifespan. Specific driver mutations or copy number alterations 
canbe placed within each of these time frames based on their ordering 
from the league model analysis. Signatures are shown if they typically 
change over time (95% confidence intervals of mean change not over- 
lapping 0), and if they are strongly active (contributing at least 10% 
mutations to onetime point). Signatures are shown on the timelinein 
the epoch of their greatest activity. Where an event found in our study 
has a known timing in the literature, the agreement is annotated on 
the timeline; with an asterisk denoting an agreed timing, and dagger 
symbol denoting a timing that is different to our results. Full details 
are given in Supplementary Information, section 7. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Somatic and germline variant calls, mutational signatures, subclonal 
reconstructions, transcript abundance, splice calls and other core 
data generated by the ICGC/TCGA PCAWG Consortium are described 
elsewhere‘ and available for download at https://dcc.icgc.org/releases/ 
PCAWG. Further information on accessing the data, including raw read 
files, can be found at https://docs.icgc.org/pcawg/data/. In accord- 
ance with the data access policies of the ICGC and TCGA projects, most 
molecular, clinical and specimen data are in an open tier that does 
not require access approval. To access information that could poten- 
tially identify participants, such as germline alleles and underlying 
sequencing data, researchers will need to apply to the TCGA Data Access 
Committee (DAC) via dbGaP (https://dbgap.ncbi.nlm.nih.gov/aa/wga. 
cgi?page=login) for access to the TCGA portion of the dataset, and to 
the ICGC Data Access Compliance Office (DACO; http://icgc.org/daco) 
for the ICGC portion. In addition, to access somatic SNVs derived from 
TCGA donors, researchers will also need to obtain dbGaP authorization. 
Datasets used and results presented in this study, including timing 
estimates for copy number gains, chronological estimates of WGD and 
MRCA, as well as mutation signature changes, are described in Sup- 
plementary Note 3 and are available at https://dcc.icgc.org/releases/ 
PCAWG/evolution-heterogeneity. 


Code availability 


The core computational pipelines used by the PCAWG Consortium for 
alignment, quality control and variant calling are available to the public 
at https://dockstore.org/search?search=pcawg under the GNU General 
Public License v3.0, which allows for reuse and distribution. Analysis 
code presented in this study is available through the GitHub reposi- 
tory https://github.com/PCAWG-11/Evolution. This archive contains 


relevant software and analysis workflows as submodules, which include 
code for timing copy number gains, point mutations and mutation 
signatures, real-time timing and evolutionary league model analysis, 
as well as scripts to generate the figures presented: CancerTiming 
(v.3.1.8), MutationTimeR (v.0.1), PhylogicNDT (v.1.1) and a series of 
custom scripts (v. 1.0), with detailed versions of other packages used. 
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Extended Data Fig. 1| Summary ofall results obtained for colorectal clonal and late clonal stages, per patient. Greenand purple indicate, 
adenocarcinoma (n= 60) as anexample.a, Clustered heat mapsofmutational —_ respectively, a signature decrease and increase in late clonal from early clonal 
timing estimates for gained segments, per patient. Colours as indicated in mutations. Inactive signatures are coloured white. e, As ind but for clonal 
main text: green represents early clonal events, purple represents late clonal. versus subclonal stages. Blue indicates a signature decrease and red an 
b, Relative ordering of copy number events and driver mutations across all increase in subclonal from clonal mutations. f, Typical timeline of tumour 
samples.c, Distribution of mutations across early clonal, late clonal and development. Similar result summaries for all other cancer types can be found 
subclonal stages, for the most common driver genes. A maximum of 10 driver inthe Supplementary Information (pages 46-77). 


genes are shown. d, Clustered mutational signature fold changes between early 


Article 


b PhylogicNDT Timing with subclonal vs MutationTimeR Cc Comparison against true timing 


= PhylogicNDT Timing, m=0.968 
=e CancerTiming, m=0.991 
=a MutationTimeR, m=1.059 


a PhylogicNDT Timing vs cancerTiming only clonal no 2/2 
1.0 


08 08 08 
© 
E 
is 
3s 
206 206 38 
€ E 2 
E 2 
iz § = 
8 Fs 3 
Ey w 04 
8 04 = 04 £ 
d 
0.2 
02 02 
0.0 
0.0 o9 
0.0 0.2 04 06 08 1.0 00 02 04 06 0s 1.0 0.0 02 04 06 08 10 
PhylogicNDT Timing PhylogicNDT Timing with subclonal True Molecular Time 


Extended Data Fig. 2| Comparison of methods used for timing of individual copy number gains. a, b, Pairwise comparison of the three approaches for timing 
individual copy number gains. c, Comparison using simulated data, showing high concordance. 
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Extended Data Fig. 3| Early copy number gains in brain cancers. a, Three n=34GBMsamples with trisomy 7.c, Medulloblastoma example with 
illustrative examples of glioblastoma with trisomy 7. The red arrow depicts the isochromosome 17q. d, Distributions of SNVs on 17q inn=95 samples with 
expected VAF cluster of point mutations preceding trisomy 7, which usually isochromosome 17q; 74 out of 95 samples have less than1SNV preceding the 
contains less than three SNVs. b, Distributions of the number of SNVs isochromosome. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4 | Validation of relative ordering model reconstruction 
based on simulated cohorts of whole-genome samples. a, Relative ordering 
model (PhylogicNDT LeagueModel) results for a simulated cohort of samples 
(n=100) froma single generalized relative order of events (with varied 
prevalence) showing high concordance with the true trajectory. Probability 
distributions show the uncertainty of timing for specific events in the cohort. 
b, Relative ordering model results ona simulated cohort of samples (n=95) 
from acomplex mixture of trajectories with different order of events showing 
high concordance with the expected average trajectory. c, Estimation of 
accuracy of the relative ordering model reconstruction by simulation ofa 

set of 100 cohorts (n(samples) = 100) with random trajectory mixtures and 
quantifying the distance in log odds early/late from perfect ordering. For the 
vast majority of events (even with low number of occurrences in the cohort), 


the log odds error does not exceed 1, confirming that very few events would 
switch between timing categories. The inset box corresponds tothe first and 
third quartiles of the distribution, the horizontal line indicates the median and 
whiskers include data within 1.5x the IQR from the box. d, Simulated data show 
concordant timing in cohorts with WGD (n= 245). Exclusion of samples with 
WGD (right, n= 242) introduces only a mild drop in accuracy, indicating that 
WGD is beneficial but not necessary for the reconstruction. Red dot =true rank. 
e, Estimated log odds in observed data including WGD (left, n= 245) and 
without (right, n = 242), across different mutation types. The inset box 
corresponds tothe first and third quartiles of the distribution, the horizontal 
line indicates the median and whiskers include data within 1.5x the IQR from 
the box. 
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Extended Data Fig. 5| Correlation between the league model and Bradley- 
Terry model ordering. Direct comparison for each tumour type of the league 
and Bradley-Terry models for determining the order of recurrent somatic 


mutations and copy number events. Axes indicate the ordered events observed 
inthe respective tumour types. Correlation is quantified by Spearman’s rank 
correlation coefficient. A total of n= 756 ordered events are shown. 
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b Examples: clonal to subclonal mutation spectrum changes 
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Extended Data Fig. 6| Examples of mutation spectrum changes across 
tumour evolution. a, Three examples of tumours with substantial changes 
between mutation spectra of early (top) and late (bottom) clonal time points. 
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b, Three examples of tumours with substantial changes between mutation 
spectra of clonal (top) and subclonal (bottom) time points. 
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Extended Data Fig. 7| Overview of early-to-late clonal and clonal-to- 
subclonal signature changes across tumour types. a, b, Pie charts 
representing signature changes per cancer type for early-to-late clonal 
signature changes (a) and clonal-to-subclonal signature changes (b). 
Signatures that decrease between early and late are coloured green; signatures 
that increase are purple. The size of each pie chart represents the frequency of 


each signature. Signatures are split into three categories: (1) clock-like, 
comprising the putative clock signatures land 5; (2) frequent, which are 
signatures present inten or more cancer types; and (3) cancer-type specific, 
which are in fewer than ten cancer types and are often limited to specific 
cohorts. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Age-dependent mutation burden and relapse samples 
indicate near-normal CpG>TpG mutation rate in cancer, with moderate 
acceleration during carcinogenesis. a, Across all cancer samples, a 
predominantly linear accumulation of CpG>TpG mutations (scaled to copy 
number) is observed over time, as measured by the age at diagnosis. b, Cancer- 
specific analysis of the CpG>TpG mutation burden asa function of age at 
diagnosis for n=1,978 samples of 34 informative cancer types. The dotted line 
denotes the median mutations per year (thatis, not offset), and shading 
denotes the 95% credible interval of a hierarchical Bayesian linear regression 
modelacross all data points. Slope and intercepts are drawn for eachcancer 
type fromagamma distribution, respectively; inference was done by 
Hamiltonian Monte Carlo sampling. c, Maximum a posteriori estimates of 
rate and offset for 34 cancer types with 95% credible intervals as defined inb. 


d, Mutation rate inferred from cancer as inb and from selected normal tissue 
sequencing studies of n=140 normal haematopoietic stem cells, n=1normal 
skin sample, n=182 samples from normal endometrium, and n=445 normal 
colonic crypts; error bars denote the 95% confidence interval.e, Median 
fraction of mutations attributed to linear age-dependent accumulation, based 
onestimates from band the age at diagnosis for each sample. Error bars denote 
the 95% credible interval. f, g, CoG>TpG mutations per gigabase for ovarian 
cancer (f) and breast cancer (g) samples with matched primary and relapse 
samples. h, Increase in CpG>TpG mutation rate inferred from paired primary 
and relapse samples for six cancer types. Bars denote the range of the rate 
increase for different scenarios of copy number evolution, assuming ploidy 
changes have occurred prior (upper value) or posterior (lower value) to the 
branching between primary and relapse sample. 
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Extended Data Fig. 9 | Real-time estimates indicate long latencies for some 
samples caused by the absence of early mutations. a, Time of WGD for n=571 
individual patients, split by tumour type with an estimated mutation rate 
increase of 5x, except for ovary-adenocarcinoma (7.5x) and CNS (2.5x). Error 
bars represent 80% confidence intervals, reflecting uncertainty stemming 
fromthe number of mutations per segment and onset of the rate increase. Box 
plots demarcate the quartiles and median of the distribution with whiskers 
indicating 5% and 95% quantiles. b, Scatter plots showing the time of diagnosis 
(x axis) and inferred time of WGD (yaxis) with error bars as ina.c, Scatter plot of 
early (co-amplified) CpG>TpG mutations (y axis) as a function of the mutational 
time estimate of WGD (xaxis). The black line denotes a nonlinear loess fit with 
95% confidence interval. Colours define the cancer typeasina.d, Total 


CpG>TpG mutations (yaxis) as a function of the mutation time estimate of 
WGD (xaxis). Colours and fit as inc. Early molecular timing is thus caused bya 
depletion of early CpG>TpG mutations, rather than an inflation of late 
CpG>TpG mutations. e, Estimated median WGD latency of n=571 patients asin 
afor fixed (x axis) versus patient specific rate increases, depending onthe 
observed CpG>TpG mutation burden, allowing for a higher (up to 10x) 
mutation rate increase insamples with more mutations (y axis). Error bars 
denote the IQR. f, Timing of subclonal diversification using CpG>TpG 
mutations in n=1,953 individual patients. Box plots and error bars for data 
points asina.g, Comparison of the median duration of subclonal 
diversification per cancer type assuming branching and linear phylogenies. 
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n/a | Confirmed 


x| The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


x A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


[x | A description of all covariates tested 


x A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


x] A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
i AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


[x] For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


x For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


x For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


x Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Data and metadata were collected from International Cancer Genome Consortium (ICGC) consortium members using custom software 
packages designed by the ICGC Data Coordinating Centre. The general-purpose core libraries and utilities underlying this software have 
been released under the GPLv3 open source license as the "Overture" package and are available at https://www.overture.bio. Other data 
collection software used in this effort, such as |CGC-specific portal user interfaces, are available upon request to contact@overture.bio. 


Data analysis The PCAWG workflows executing core WGS alignment, QC and variant-calling software are packaged as executable Dockstore images and 
available at: https://dockstore.org/search?labels.value.keyword=pcawg&searchModez=files. Individual software components are as 
follows: BWA-MEM v0.78.8-r455; DELLY v0.6.6; ACEseq v1.0.189; DKFZ somatic SNV workflow v1.0.132-1; Platypus v0.7.4; ascatNgs 
v1.5.2; BRASS v4.012; grass v1.1.6; CaVEMan v1.50; Pindel v1.5.7; ABSOLUTE/JaBbA v1.5; SVABA 2015-05-20; dRanger 2016-03-13; 
BreakPointer 2015-12-22; MuTect v1.1.4; MuSE v1.0rc; SMuFIN 2014-10-26; OxoG 2016-4-28; VAGrENT v2.1.2; ANNOVAR v2014Nov12; 
VariantBAM v2017Dec12; SNV-Merge v2017May26; SV-MERGE v2017Dec12; DKFZ v2016Dec15. 

Analysis code presented in this study is available through the github repository https://github.com/PCAWG-11/Evolution. This archive 
contains relevant software and analysis workflows as submodules, including code for timing copy number gains, point mutations and 
mutation signatures, real-time timing, and evolutionary league model analysis, as well as scripts to generate the figures presented. 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


WGS somatic and germline variant calls, mutational signatures, subclonal reconstructions, transcript abundance, splice calls and other core data generated by the 
ICGC/TCGA Pan-cancer Analysis of Whole Genomes Consortium are available for download at https://dcc.icgc.org/releases/PCAWG. Additional information on 
accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/. In accordance with the data access policies of the ICGC and TCGA 
projects, most molecular, clinical and specimen data are in an open tier which does not require access approval. To access potentially identification information, 
such as germline alleles and underlying sequencing data, researchers will need to apply to the TCGA Data Access Committee (DAC) via dbGaP (https:// 
dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the dataset, and to the ICGC Data Access Compliance Office (DACO; http:// 
icgc.org/daco) for the ICGC portion. In addition, to access somatic single nucleotide variants derived from TCGA donors, researchers will also need to obtain dbGaP 
authorization. All results presented in this study, including timing estimates for copy number gains, real time estimates of WGD and MRCA, as well as mutation 
signature activities, are available at https://www.synapse.org/#!Synapse:syn14193595. 


Field-specific reporting 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size We compiled an inventory of matched tumour/normal whole cancer genomes in the ICGC Data Coordinating Centre. Most samples came from 
treatment-naive, primary cancers, but there were a small number of donors with multiple samples of primary, metastatic and/or recurrent 
tumours. Our inclusion criteria were: (i) matched tumour and normal specimen pair; (ii) a minimal set of clinical fields; and (iii) 
characterisation of tumour and normal whole genomes using Illumina HiSeq paired-end sequencing reads. We collected genome data from 
2,834 donors, representing all ICGC and TCGA donors that met these criteria at the time of the final data freeze in autumn 2014. 


Data exclusions | After quality assurance, data from 176 donors were excluded as unusable. Reasons for data exclusions included inadequate coverage, 
extreme bias in coverage across the genome, evidence for contamination in samples and excessive sequencing errors (for example, through 8- 


oxoguanine). Hypermutated and samples with normal contamination were excluded for chronological inferences in this study, as described in 
the Supplementary Methods. 


Replication In order to evaluate the performance of each of the mutation-calling pipelines and determine an integration strategy, we performed a large- 
scale deep sequencing validation experiment. We selected a pilot set of 63 representative tumour/normal pairs, on which we ran the three 
core pipelines, together with a set of 10 additional somatic variant-calling pipelines contributed by members of the SNV Calling Working 
Group. Overall, the sensitivity and precision of the consensus somatic variant calls were 95% (CI90%: 88-98%) and 95% (CI90%: 71-99%) 
respectively for SNVs. For somatic indels, sensitivity and precision were 60% (34-72%) and 91% (73-96%) respectively. Regarding SVs, we 
estimate the sensitivity of the merging algorithm to be 90% for true calls generated by any one caller; precision was estimated as 97.5% - that 
is, 97.5% of SVs in the merged SV call-set have an associated copy number change or balanced partner rearrangement. 

The accuracy of inferences in this study was assessed using simulations and by applying three different algorithms for the timing of copy 


number gains (Extended Data Figure 2), as well as two different algorithms for the temporal ordering of driver mutations (Extended Data 
Figure 5). 


Randomization —_N/A- This exploratory study did not contain a randomization step 


Blinding N/A - This exploratory study did not contain a blinded analysis 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 
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Human research participants 


Policy information about studies involving human research participants 


Population characteristics 


Recruitment 


Ethics oversight 


Patient-by-patient clinical data are provided in Extended Data Table 1 of the marker paper for the PCAWG consortium. 
Demographically, the cohort included 1,469 males (55%) and 1,189 females (45%), with a mean age of 56 years (range, 1-90 
years). Using population ancestry-differentiated single nucleotide polymorphisms (SNPs), the ancestry distribution was heavily 
weighted towards donors of European descent (77% of total) followed by East Asians (16%), as expected for large contributions 
from European, North American and Australian projects. We consolidated histopathology descriptions of the tumour samples, 
using the ICD-0-3 tumour site controlled vocabulary. Overall, the PCAWG data set comprises 38 distinct tumour types. While the 
most common tumour types are included in the dataset, their distribution does not match the relative population incidences, 
largely due to differences among contributing ICGC/TCGA groups in numbers sequenced. 


Patients were recruited by the participating centres following local protocols. 


The Ethics oversight for the PCAWG protocol was undertaken by the TCGA Program Office and the Ethics and Governance 
Committee of the ICGC. Each individual ICGC and TCGA project that contributed data to PCAWG had their own local 
arrangements for ethics oversight and regulatory alignment. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Transcript alterations often result from somatic changes in cancer genomes’. Various 
forms of RNA alterations have been described in cancer, including overexpression’, 
altered splicing’ and gene fusions‘; however, it is difficult to attribute these to 
underlying genomic changes owing to heterogeneity among patients and tumour 
types, and the relatively small cohorts of patients for whom samples have been 
analysed by both transcriptome and whole-genome sequencing. Here we present, to 
our knowledge, the most comprehensive catalogue of cancer-associated gene 
alterations to date, obtained by characterizing tumour transcriptomes from1,188 
donors of the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium of the 
International Cancer Genome Consortium (ICGC) and The Cancer Genome Atlas 
(TCGA)°. Using matched whole-genome sequencing data, we associated several 
categories of RNA alterations with germline and somatic DNA alterations, and 
identified probable genetic mechanisms. Somatic copy-number alterations were the 
major drivers of variations in total gene and allele-specific expression. We identified 
649 associations of somatic single-nucleotide variants with gene expression in cis, of 
which 68.4% involved associations with flanking non-coding regions of the gene. We 
found 1,900 splicing alterations associated with somatic mutations, including the 
formation of exons within introns in proximity to Alu elements. In addition, 82% of 
gene fusions were associated with structural variants, including 75 of anew class, 
termed ‘bridged’ fusions, in which a third genomic location bridges two genes. We 
observed transcriptomic alteration signatures that differ between cancer types and 
have associations with variations in DNA mutational signatures. This compendium of 
RNA alterations in the genomic context provides a rich resource for identifying genes 
and mechanisms that are functionally implicated in cancer. 


For amore extensive study of cancer genome alterations, particularly 
in non-coding regions, the PCAWG project was formed to analyse 
the large number of whole-genome samples that were contributed 
to the ICGC and TCGA projects>. Individual projects did not use the 
same methods for key analyses; therefore, a major focus for each of 
the 16 PCAWG Working Groups was the unified analysis of the PCAWG 
data. For example, the PCAWG Technical Working Group led raw 
data collection, realignment of whole-genome sequencing data and 
implemented core somatic mutation calling pipelines’. Other 
PCAWG working groups focused on unified analyses of copy- 
number variation®, structural variants’’, germline variants>, 


mutational signatures’ and identification of driver genes®, among 
others®. Here, we report the joint analysis of available matched tran- 
scriptome and genome profiling for 1,188 samples from 27 tumour 
types by the PCAWG Transcriptome Working Group’, providing the 
largest, to our knowledge, resource of RNA phenotypes and their 
underlying genetic changes in cancer so far (Extended Data Fig. 1, 
Methods, Supplementary Results, Supplementary Table 23). We dem- 
onstrate the importance of transcriptomics data in understanding 
how different dimensions of specific DNA alterations contribute to 
carcinogenesis and map out the landscape of cancer-related RNA 
alterations. 


A list of affiliations appears at the end of the paper. 
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Cancer-specific germline cis-eQTLs 


To investigate the underlying mechanisms of different types of RNA 
alteration, we first focused on changes in the gene expression level 
(Extended Data Fig. 2). We initially considered common germline 
variants (minor allele frequency = 1%) proximal to individual genes 
(+100 kb), and mapped expression quantitative trait loci (eQTL) 
across the cohort (Extended Data Fig. 3, Supplementary Table 1). This 
pan-cancer analysis identified 3,532 genes with an eQTL (false dis- 
covery rate (FDR) < 5%, hereafter denoted eGenes) (Supplementary 
Table 2), enriched in proximal regions of transcription start sites (TSSs) 
(Extended Data Fig. 3). 

To identify cancer-specific regulatory variants, we compared our 
eQTLs to eQTLs from the Genotype-Tissue Expression (GTEx) pro- 
ject’, adopting previous strategies to assess eQTL replication”, and 
probed lead eQTL variants for marginal significance in GTEx tissues 
(P< 0.01, Bonferroni-adjusted). Although most lead variants could 
be detected in GTEx samples (3,110 out of 3,532 eQTL variants), we 
identified 422 eQTLs that did not correspond to GTEx tissues, which 
suggests cancer-specific regulation (Extended Data Fig. 4, Supplemen- 
tary Table 3). The corresponding eQTL lead variants were enriched 
for heterochromatic regions (Fig. 1a). Overall, this analysis revealed 
that the germline framework of gene expression regulation is largely 
conserved in cancer tissues. 


Somatic cis-eQTLs in non-coding regions 

Previous studies have described the landscape of non-coding mutations 
in cancer’, particularly in promoter regions, and also their regulatory 
effects on gene expression”. Here, we looked at possible somatic 
DNA changes, across the whole genome, that underlie alterations in 
gene expression. We estimated local mutation burdens by aggregating 
single-nucleotide variants (SNVs) in 2-kb intervals adjacent to genes 
(flanking), as well as in exons and introns (Extended Data Figs. 2, 5, 6). 
Next, we decomposed the expression variation of individual genes, 
considering common mutation burdens in cis, as well as cis germline 
variants and somatic copy-number alterations (SCNAs). This identified 
SCNAs as the major driver of expression variation (17%), followed by 
somatic SNVs in gene flanking regions (1.8%) and germline variants 
(1.3%) (Fig. 1b). 

We also tested for associations between all common mutation bur- 
dens and gene expression across the whole genome. We identified 
649 genes with a somatic eQTL (FDR < 5%) (Supplementary Table 5). 
Of these, 11 associations were located in introns or exons of the respec- 
tive eGene, including genes with known roles in the pathogenesis of 
specific cancers such as CDK12in ovarian cancer" and /JRF4 in chronic 
lymphocytic leukaemia® (Extended Data Figs. 7, 8). Most eQTLs (68.4%) 
involved associations with flanking non-coding mutation burdens 
(Extended Data Fig. 6e). Next, we considered eQTLs in flanking regions 
(n= 556) and tested for enrichment in cell-type-specific annotations 
from the Epigenetics Roadmap”. This identified 13 enriched anno- 
tations (FDR < 10%) (Extended Data Fig. 9, Supplementary Table 6), 
including poised promoters, weak and active enhancers, and hetero- 
chromatin, but notably no enrichment for transcription-factor-binding 
sites (Supplementary Table 7). This enrichment in transcriptionally 
inactive regions may be due to an increased mutation rate in these 
regions (Extended Data Fig. 9), which has previously been reported 
incancer”. 

We also looked at the functional characterization of somatic eGenes 
and observed an enrichment for somatic eQTLs in bivalent promoters 
for cancer testis genes (P= 0.04, Fisher’s exact test) such as TEKTS'® 
(Fig. 1c, Extended Data Fig. 8h). Furthermore, we found a global enrich- 
ment (FDR < 10%) for Gene Ontology (GO) categories related to cell 
differentiation and developmental processes (Supplementary Table 8). 
Overall, somatic eQTL analysis identified mostly non-coding regions 
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associated with changes in local gene expression and, similar to cancer- 
specific germline eQTLs, showed enrichment for transcriptionally 
inactive regions such as heterochromatin. 


Expression and mutational signatures 


Global variations in mutational patterns can be quantified using muta- 
tional signatures, which tag mutational processes specific to their 
tissue-of-origin and environmental exposures”. However, the extrac- 
tion of mutational signatures is an intrinsically statistical process that 
requires a posteriori functional annotation. We performed a pan-cancer 
association analysis between genome-wide mutational signatures 
and gene expression levels to decipher the molecular processes that 
accompany the presence of mutational signatures. 

We considered 28 mutational signatures derived using non-negative 
matrix factorization of context-specific mutation frequencies’. We 
tested for association between signature prevalence in donors and total 
gene expression, accounting for total mutational burden, cancer type, 
and other technical and biological confounders. This identified 1,176 
genes associated with at least one signature (FDR < 10%) (Extended 
Data Fig. 10, Supplementary Table 19). 

We considered 18 signatures with 20 or more associated genes for 
further annotation (Extended Data Fig. 11) and assessed enrichment 
using GO categories”° and Reactome pathways”. We found that 11 
signatures were enriched for at least one category (FDR < 10%) (Sup- 
plementary Table 19), revealing associations consistent with known 
and unknown aetiologies (Fig. 1d). For example, signature 38, which 
is correlated with the canonical UV signature 7 (r° = 0.375, P=5 10°) 
(Extended Data Fig. 11c), was linked to melanin processes (Fig. 1d). The 
synthesis of melanin causes oxidative stress to melanocytes”, and we 
found signature 38 associated with the oxidative-stress-promoting 
gene TYR” (P=1.0 x 10%). A hallmark of signature 38 genes are C>A 
mutations, atypical product of reactive oxygen species”. This suggests 
that signature 38 may capture DNA damage thatis indirectly caused by 
UV-induced oxidative damage after direct sun exposure”, with TYR as 
a possible mediator of the effect. 


Genomic basis of allelic expression 


To analyse expression at the level of individual haplotypes, we tested 
for allelic expression imbalance (AEI) (FDR < 5%, binomial test). We 
observed substantial differences in the fraction of genes with AEI 
between different types of cancer (Extended Data Fig. 12), and between 
cancer and the corresponding healthy tissues, with a high observed 
concordance between allelic imbalance at the DNA and RNA levels 
(Extended Data Fig. 13). 

We used a logistic regression model to identify the determinants 
of AEI, accounting for known imprinting status”, the germline eQTL 
genotype, SCNAs and the weighted mutational burden of proximal 
somatic SNVs stratified into functional categories (Extended Data 
Fig. 2). Inaggregate, SCNAs accounted for 84.3% of the total explained 
variation, which confirmed our findings from the somatic eQTL anal- 
ysis, followed by germline eQTL lead variants (9.1%), somatic SNVs 
(4.9%) and imprinting status (1.7%) (Extended Data Fig. 14). Although 
cumulatively, non-coding variants were more relevant than coding 
variants, somatic protein-truncating variants (‘stop-gained’ variants) 
that triggered nonsense-mediated decay” were the most predictive 
individually. SNVs within splice regions, 5’ untranslated regions (UTRs) 
and promoters were also strongly associated with the presence of AEI, 
and we observed a global trend of decreasing relevance of variants 
with increasing distance from the TSS (Fig. le, Extended Data Fig. 14). 

Gene-centric attribution of AEI to individual sources of genetic vari- 
ation (Supplementary Table 9) revealed an enrichment of somatically 
induced AElin several known cancer-driver genes, as well as new candi- 
dates, suchas the mismatch-repair-related gene EXO1 that is associated 


PCAWG-specific 
(Replicated in GTEx 


Germline eQTL 


= 
a 


ar 
Oo 


Mean fold change 
oa 


Oo 


Mean variance explained per gene (%) 


= 
{=} 
i=] 
oO 


2.0 


= 
Oo 
=I 


1.5 


[5 
No. of genes 
o5 


Adaptive immune response 

Lymphocyte proliferation 

Biological oxidations 

Regulation of histone methylation 

Male meiosis | 

DNA deamination 

Transposition 

Reg. of neurotransmitter receptor activity 
Reg. of glutamate receptor signalling 


1.01 5 


0.5 


Change in log—odds 


mas Bivalent promoter 
mm Flanking 
mas Intronic 


TEKTS 


-log, o(P) 
-~ Mh wo fF A DN CO OO 
° 


=] 


Keratinization 
Peptide cross-linking 
Response to vitamin D 


. 
t) 
' 
ot 
a] 
w 
be 
? 
het 
—= 
a] 
at 
’ 
a] 
» 


Developmental pigmentation 
Eukaryotic translation elongation 
Peptide chain elongation 

Platelet adhesion to exposed collagen 
CD209 signalling 

ABCA transporters in lipid homeostasis 
mRNA decay by exoribonuclease 
Negative reg. of cholesterol transport 
Negative reg. of sterol transport 


Imprinted 


log(no. ASE sites) 


log(read depth, sample) 


Fig. 1|Germline and somatic SNVs associated with expression. a, Epigenetics 
Roadmap enrichment analysis, showing the average fold change in Roadmap 
factors across cell lines in PCAWG-specific eQTLs of the pan-analysis as well as 
eQTLs that replicate in GTEx tissues. *P< 0.05/25, one-sided Wilcoxon rank- 
sum test in PCAWG-specific eQTLs corrected for the number of Roadmap 
factors used (that is, 25). Dataare mean and s.d. b, Variance component analysis 
for gene expression levels, showing the average proportion of variance 
explained by different germline and somatic factors for different sets of genes 
including the mean effect across all factors: (1) all genetic factors (germline and 
somatic); (2) SCNAs; (3) somatic variants in flanking regions; (4) population 
structure; (5) cis-germline effects; and (6) somatic intron and exon mutation 
effects. c, Manhattan plot showing nominal P values of association for TEKTS 
(highlighted in grey), considering flanking, intronic and exonic intervals. 


with survival in colorectal adenocarcinoma (log-rank P= 0.022, hazard 
ratio = 0.57) (Supplementary Results). We further observed a strong 
enrichment in the AEI score of cancer testis genes based on somatic 
SNVs only (x test P= 6 x 10°). Insummary, we identify somatic and 
germline genetic variation that is associated with allele-specific dys- 
regulation of genes across cancer types. 


Mutations associated with promoter usage 


We considered promoter activity”* ® as another molecular phenotype 
to study the effect of promoter mutations. Although cancer-specific 
alternative promoter usage has previously been shown”*, the associa- 
tion of underlying genomic alterations with promoter activities have 
not been broadly explored. To estimate the activity of individual gene 
promoters, we combined the expression of isoforms initiated in TSSs 
that are identical or nearby, assuming that these are transcribed from 
the same promoter (Extended Data Fig. 15a—c). We divided promoters 
into three categories: (1) inactive promoters (activity <1 fragment per 
kilobase of transcript per million mapped reads (FPKM)), (2) major pro- 
moters (most active per gene) and (3) minor (all remaining) promoters, 
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The leading somatic burden is associated with increased TEKTS expression 
(P=1.61x 10°) and overlaps an upstream bivalent promoter (red dots; 
annotated in 81 Roadmap cell lines, including 8 embryonic stem cells, 9 
embryonic-stem-cell-derived and 5 induced pluripotent stem-cell lines). 

d, Summary of significant associations between mutational signatures (Sig) 
and gene expression. Top, the total number of associated genes per signature 
(FDR<10%). Bottom, enriched GO categories or Reactome pathways for genes 
associated with each signature (FDR<10%, significance level encodedin 
colour, -log,)-transformed adjusted P value). e, Standardized effect sizes on 
the presence of AEI, taking only SCNAs, germline eQTLs, coding and non- 
coding mutations into account. Data are the estimate and standard error of the 
estimate of the effect size. 


and examined the rates of mutation across varying activity levels. We 
observed an increase in the number of mutations near the TSS of major 
promoters compared with minor or inactive promoters (Extended 
Data Fig. 15d). This pattern is most prominent in skin melanoma, in 
which it has been attributed to impaired nucleotide-excision repair 
(Extended Data Fig. 15e, f, k, 1). The cancer type that shows the strong- 
est deviation from this pattern is colorectal adenocarcinoma, which 
highlights the tissue-specificity of mutational patterns at promoters 
(Extended Data Fig. 15e, f, m, n). Only 171 promoters show mutations 
inmore than 5 samples per tumour type ina 200-bp window upstream 
of the promoter (Extended Data Fig. 15g, h). Most mutations occur in 
skin melanoma and lymphoma, which is expected owing to reduced 
nucleotide-excision repair and activation-induced cytidine deaminase 
(Extended Data Fig. 15h). We did not find significant pan-cancer asso- 
ciations between promoter mutational burden and promoter activity 
(Extended Data Fig. 15i, j). However, TERT has the highest number of 
promoter mutations’ (Extended Data Fig. 16a), and these mutations 
have previously been reported to be associated with TERT expression’; 
therefore, we investigated the TERT locus in more detail (Extended Data 
Fig. 16b). Although TERT does not show a significant association inthe 
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Fig. 2| Position-specific effect of somatic mutations on alternative splicing. 
a, Top, proportion of mutations near exon-intron junctions and at branchsites 
that are associated with exon-skipping events. Mutations with associated 
splicing changes are those in which the percentage spliced in-derived |z-score| 
is >3 (dark blue). Asterisks denote intron positions significantly enriched for 
splicing changes relative to background based ona permutation test. *P<0.05, 
**P<0.01,***P< 0.001. Bottom, sequence motifs of regions. b, Example of an 


pan-cancer analysis, we found an association with increased promoter 
activity in individual types of cancer’ (Extended Data Fig. 16c). 


Mutations associated with splicing 


Extending the classical hallmarks of cancer, alternative splicing is 
seen as increasingly relevant to explain cancer heterogeneity”. On 
the basis of our observations of a globally changing splicing landscape 
(Extended Data Fig. 17a-c), we sought to specifically understand the 
relationship between splicing changes and somatic mutations within 
introns. Focusing on cassette exon events, we integrated the quan- 
tification of splice events with somatic variants and identified 5,282 
mutations near exon-intron boundaries, 1,800 (34%) of which were 
associated with a change in splicing (|z-score| = 3) (Supplementary 
Table 10). Consistent with previous findings using exome sequenc- 
ing®**, most mutations overlapping the essential dinucleotide motifs 
of the acceptor or donor site are associated with a splicing change—61% 
or 57%, respectively (Fig. 2a). Nearly one-third of all mutations (226 
out of 469) in a 5-nucleotide window downstream of the 5’ site were 
significantly enriched for splicing changes (Fig. 2a). Almost all changes 
significantly associated with somatic mutations had a negative effect 
on splicing (96%) (Extended Data Fig. 17d). For mutations in or near 
the poly-pyrimidine tract, we found a significant (permutation test, 
P<0.05) enrichment for mutations linked to outlier splicing (Fig. 2a). 
We also found an enrichment (P< 0.05, fold change > 2) of splicing outli- 
ers at branch-site adenosines (Fig. 2a middle, Extended Data Fig. 17d, 
Supplementary Table 11). Together, these results suggest that somatic 
mutations inthe extended splice site region, poly-pyrimidine tract and 
branch point can affect splicing. 

We also identified 1,900 rare splicing-associated variants (SAVs) that 
appear in only asmall number of samples using the SAVNet approach*® 
(Extended Data Fig. 17e; see ‘Data availability’ in the Methods). Notably, 
862 SAVs affected canonical splice sites, whereas the other 1,038 dis- 
rupted non-canonical sites or created new splice sites. Notably, we find 
atwofold enrichment of cancer genes in SAVs (Extended Data Fig. 17f). 

Although we find that those SAVs that create splice sites strongly con- 
centrate near exon-intron boundaries (Extended Data Fig. 17g), 45.9% 
of SAVs are further than 100 bp away from the nearest annotated exon. 
Mutations at those sites generally changed the sequences towards the 
donor or acceptor motif consensus (Extended Data Fig. 17h). Focusing 
onnovel splice sites deep inintrons, we analysed the extent of exoniza- 
tions—that is, the formation of new exons within an intron (Extended 
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exonization event in the tumour-suppressor gene STK11. The RNA-seq read 
coverage fora part of the gene is shown in red for adonor carrying the 
alternative allele, andin grey forarandom donor with reference allele. The 
cassette exon event is shownas a schematic below. c, Enrichment of SINE 
elements in SAVs compared to sequence background (BG). Shown for SINE 
elements overlapping in sense (middle) and antisense (right) directions. 


Data Fig. 17j, Supplementary Tables 13, 14). More than one-fifth of these 
new exons (9 out of 43) occur in cancer-related genes, suchas the well- 
known tumour-suppressor gene STK11. As expected, the exonization 
event would cause a frameshift in S7K11 (Fig. 2b, Extended Data Fig. 17k). 

Aluelements that are inserted in an antisense direction have sequences 
that resemble consensus splice sites that, together with activating muta- 
tions, canlead to the formation ofa new exon” (Extended Data Fig. 171). 
We found a significant enrichment of splice-site-creating SAVs within 
annotated Alu sequences (P= 2.8 x 10°’), particularly in the antisense 
direction (P=2.6 x10~) (Fig. 2c). Our results indicate that the exonization 
of Alu sequences, which has been extensively studied in the context of 
primate genome evolution, is also observed in cancer genome evolution. 


Patterns of gene fusions across cancer 


Gene fusions are an important class of cancer-driving event with thera- 
peutic and diagnostic value”. We identified a total of 925 known and 
2,372 new cancer-specific gene fusions by combining the output of two 
fusion discovery methods as well as genomic rearrangement (structural 
variants) information and excluding artefacts or fusions innon-cancer 
samples*s (Fig. 3a). For the 3,540 identified fusion events representing 
3,297 unique gene fusions, we categorized them on the basis of novelty, 
recurrence and known oncogenic gene partners (Fig. 3a). 

Only 149 (approximately 5%) of the fusions occur in more than one 
sample, among which 78 are novel. Most of these (46 out of 78) were 
found across several histotypes. Of the 27 most recurrent gene fusions 
(Extended Data Fig. 18a), 8 have previously been reported (for example, 
CCDC6-RET*’, FGFR3-TACC3” and PTPRK-RSPO3) or independently 
detected in the TCGA cohort*!, whereas 6 were new (such as NUMB- 
HEATR4, ESR1-AKAP12 and TRAF3IP2-FYN). In total, 105 fusion tran- 
scripts involved the UTR region of one gene and the complete coding 
sequences of another gene, possibly resulting from structural variation 
in promoter regions. 

Although most genes involved in fusions engaged with only one 
fusion partner, 35 genes had more than S partners. These ‘promiscu- 
ous’ genes tended to be selective in being either a 5’ or a3’ partner 
with conserved break points and positions (3’ or 5’), and were over- 
represented in cancer census genes and the PCAWG cancer-driver 
genes (one-tailed Fisher's exact test, odds ratio = 8.66, P< 1.110, 
and odds ratio = 12.27, P< 2.2 x 10°, respectively). Network analysis 
of promiscuous genes and their partners revealed several large gene 
clusters containing at least 10 genes (Extended Data Fig. 18b), enriched 
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Fig. 3 | Structural rearrangements associated with RNA fusions. a, The 
number of all detected and new fusions and their overlap with the cancer 
census genes. b, Schematic of an example of bridged fusions. Bridged fusions 


in cancer-related pathways (Benjamini-Hochberg corrected P< 0.01) 
and in protein-protein interactions (P< 1.0 x 10”), which suggests a 
possible functional role in cancer. 

Notably, alarge number of fusions, including known fusions, could not 
be associated with only a single structural-variation event. For example, 
the ETV6-NTRK3 gene fusion” was present in a head and neck thyroid 
carcinoma sample, linking exon 4 of ETV6 to exon 12 of NTRK3. We found 
three separate structural variants in the same sample: (1) atranslocation 
of ETV6to chromosome 6; (2) atranslocation of N7RK3also to chromo- 
some 6; and (3) an additional copy-number loss spanning from intron 
5 of ETV6 to the exact structural variant break points, jointly bringing 
ETV6within 45 kb upstream of NTRK3—a distance that would allowtran- 
scriptional read-through” or splicing” to yield the ETV6-NTRK3 fusion® 
(Fig. 3b). Thus, the short chromosome-6 segment appeared to func- 
tionas abridge, which linked two genomic locations to facilitate a gene 
fusion. We term such products ‘bridged fusions’. This class of fusion is not 
uncommon. Out ofa total of 436 gene fusions supported by 2 separate 
structural variants, 75 are bridged fusions (Supplementary Table 15). 

On the basis of the nature of the underlying genomic rearrange- 
ments, we propose a unified fusion classification system (Extended 
Data Fig. 19a). Aside from bridged fusions, 344 additional fusions are 
linked to more than one structural variant in the same sample. These 
multi- structural variant fusions are collectively termed ‘composite 
fusions’ (Extended Data Fig. 19a, b). We find 284 intercomposite fusions 
(interchromosomal translocation) and 124 intracomposite fusions 
(intrachromosomal rearrangement), exemplified by FRCI-RET1 and 
NUMB-HEATR4 fusions, respectively (Extended Data Fig. 19b). Com- 
posite rearrangements bring the fusion partners significantly closer to 
each other, from the median natural distance of 6.8 Mb to the median 
of 7.9 kb (Wilcoxon rank-sum test, P< 2.2 x 10“; Extended Data Fig. 19c) 
after translocation. For 18% of fusions, no evidence of structural varia- 
tion was found. Given that 340 structural-variant-independent, intra- 
chromosomal fusions had significantly closer break points than those 
with structural variation (Extended Data Fig. 19d), it is possible that they 
could result from RNA read-through events. The other possibility is that 
the underlying supporting structural variants escaped detection, as 
shown by the observation that known gene fusions that are driven by 
structural variation, such as TMPRSS2-ERG**, did not have consistent 
evidence for structural variation in matching samples. 


Landscape of RNA alterations in cancer 


Given our comprehensive set of RNA alterations, we sought to char- 
acterize the heterogeneous mechanisms of cancer genome and 
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transcriptome alterations. To enable joint analyses of RNA and DNA 
alterations, we created a gene-level table, which indicates the presence 
or absence of possible functional changes to RNA or DNA for each gene 
and donor. After stringent filtering, we identified 1,523,098 alteration 
events, in whichan event is a gene-sample-alteration triplet (Extended 
Data Table 1, Supplementary Table 14). It should be noted that we chose 
to include only RNA alterations with potential functional effects or 
with the strongest quantitative affect, resembling similar strategies for 
filtering DNA alterations’. Recurrence analysis across several alteration 
types helped us to further enrich for functionally relevant genes. Build- 
ing on the gene-centric table, we characterized gene alterations at the 
RNA level and contrasted these with DNA alterations (non-synonymous 
SNVs or SCNAs)°*. On the basis of the calculated association between 
each RNA- and DNA-level alteration across all histotypes, we found that 
half of the RNA alterations significantly correlated with DNA alterations 
(likelihood ratio test, FDR <1 10) (Extended Data Fig. 20). 

When comparing gene alteration frequencies across all histotypes 
(Fig. 4a), we note that different types of cancer contain distinct com- 
binations of DNA- and RNA-level alterations (Fig. 4a, Supplementary 
Table 17). Although, as expected, skin melanoma significantly exceeds 
other cancers in the number of non-synonymous SNVs*8 (Wilcoxon 
rank-sum test, P< 0.012), lymphatic cancers have low numbers of 
SNVs (Wilcoxon rank-sum test, P=5.3 x 10), but high incidences of 
alternative splicing outliers (Wilcoxon rank-sum test, P=4.9 x 107%”), 
which suggests that transcriptomic alterations can be relatively more 
pronounced in certain cancer types. 

To evaluate to which extent RNA changes provide additional mecha- 
nisms for cancer gene alterations, we examined DNA- and RNA-level 
alterations both in sets of genes in pathways (Extended Data Fig. 21) 
and in individual genes with known roles in cancer (Extended Data 
Fig. 22). We found that RNA alterations occur at a high proportion in 
many pathways, including the NOTCH and TGF-B pathways. In addition, 
KRAS exhibits more RNA alterations than DNA alterations insome types 
of cancer. Given the recent finding that alternative splicing of KRAS 
expanded the prognostic affect beyond mutation status in colorectal 
cancer”, our data further support several modes of alteration for KRAS 
intumours. 


Co-occurrence of RNA and DNAalterations 

The diverse types of alteration in this study enabled us to investigate 
trans-associations between different genetic and expression charac- 
teristics involving cancer-related genes (FDR < 5%) (Supplementary 
Table 18). By investigating whether somatic mutations of known cancer 
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Fig. 4| Global view of DNA and RNA alterations that affect tumours. a, The 
median numbers of different alterations across histotypes. Histotypes are 
ordered by hierarchical clustering based on the pattern of different types of 
alteration. Only histotypes with more than 10 donors are shown. Alt., alternative; 
non-syn, non-synonymous. Cancer-type abbreviations are listed in 
Supplementary Table 23. b, c, Circular representations of the selected genes 
significantly co-occurred with B2M (b) and PCBP2(c). Connecting lines indicate 
the specific types of co-occurrence of alteration pairs. The inner histograms 


genes are associated with the expression of other genes, we found /DH1 
and NFKBIE to be widely linked to the dysregulation of many genes 
(Extended Data Fig. 23a, b). Notable co-occurrences were present in 
several types of cancer. For example, B2M and ElF4G2 alterations were 
simultaneously observed in both B-cell non-Hodgkin lymphoma and 
lung squamous cell carcinoma. Pathway enrichment analysis of the top 
100 genes associated with all B2M alterations indicates that the most 
affected genes are involved in DNA repair (FDR<1%), and approximately 
two-thirds of those associations were significant in more than one 
cancer type (Fig. 4b, Extended Data Fig. 23c). 

We also examined how cancer genes could be affected by other genes 
by co-occurrence analyses. Expression outliers of PCBP2co-occurred 
with aberrant splicing ofa large number of cancer-related genes, includ- 
ing CTNNB1 and CDK¢4 (Fig. 4c). PCBP2 has been reported to enhance 
the splicing of cassette exons*°. Our results thus further support the 
possible role of PCBP2in regulating the splicing of cancer-related genes. 


Recurrent RNA alterations in driver genes 

In our analyses of cis-acting mutations that are associated with these 
individual RNA phenotypes, the vast majority were observed rarely in 
the PCAWG cohort. Many cancer genes (such as MET>”) are known to 
be somatically altered by heterogeneous mechanisms such as gene 
fusions, splicing mutations and non-synonymous mutations; therefore, 
examining genes that are altered by several cis-acting mechanisms 
may help to identify cancer genes in which an individual alteration 
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indicate the frequencies of incidences of different alteration types shownin 
different colours. d, All 74 Catalogue of Somatic Mutations in Cancer (COSMIC) 
cancer census genes or PCAWG driver genes that are both frequently and 
heterogeneously altered across both RNA- and DNA-level alterations. Yellow bars 
indicate the proportion of samples that had DNA-level alterations, and green 
bars indicate the proportion of samples with RNA-level alterations. Middle 
columnis the proportion of each alteration type observed for that gene. e, The 
enrichment of cancer genes within our list of significantly recurrent genes. 


type is rare. A total of 5,413 genes were altered by gene expression, 
allele-specific expression (ASE), splicing and/or gene fusion, and had 
an associated DNA-level mutation in cis (Supplementary Table 20). 
PCAWG-defined driver genes® tended to have more diverse mecha- 
nisms of RNA-level alterations when compared to genes that have 
not previously been identified as a cancer gene (P< 0.001) (Extended 
Data Fig. 24a). We identified, for example, a somatic eQTL, a splicing- 
associated variant and fusions in the known tumour-suppressor NF1 
in the MAPK pathway (Extended Data Fig. 24b). 

Owing to the fact that most somatic mutations are rare’, it is diffi- 
cult to statistically distinguish functionally relevant, potential driver 
alterations from passenger alterations. Therefore, we aimed to identify 
genes that are both recurrently and heterogeneously altered, under 
the hypothesis that these genes have increased functional relevance. 
This analysis identified 731 genes with significant recurrent aberra- 
tions (FDR < 5%) (Extended Data Fig. 25a), with the top-ranking genes 
carrying both RNA and DNA alterations. RNA alterations accounted 
for 0.05-99.14% (mean: 78.23%) of all identified alterations in each 
gene (Extended Data Fig. 25a, Supplementary Table 21). This rank- 
ing is enriched for the union of cancer census genes~ (60 out of 603) 
and PCAWG- defined driver genes (33 out of 157, unioned: 74 out of 674 
P=4,6 x10, enrichment: 2.45) (Fig. 4d, e). 

Among the top 10% of our ranked genes is CDK12 (rank 55). We find 
91samples that have an alteration involving its protein kinase domain, 
which has been implicated in DNA repair dysregulation™. Many of these 
samples have no DNA-level alterations in CDK12 (46%) (Extended Data 


Fig. 26a). Furthermore, splicing, alternative promoter, SNV, RNA-edit- 
ing and fusion alterations in this gene are mutually exclusive (adjusted 
P=4.8 x 10°) (Extended Data Fig. 26b, c). Upon further investigation, we 
find that somatic eQTL mutations in CDK12are associated withatandem 
duplicator phenotype®. Although this association was not replicated 
with other RNA alterations, it provides evidence that somatic CDK12 
mutations may alter its function through gene expression changes. 
This example illustrates that performing a recurrence analysis over 
diverse RNA and DNA alterations can help to identify genes known to 
be important in tumorigenesis. 


Discussion 


Here we present a comprehensive catalogue of RNA-level alterations in 
cancer, spanning 27 different tumour types, and provide a harmonized 
resource of matched transcriptome and whole-genome sequences. We 
identified 731 genes that were recurrently altered by several mecha- 
nisms, jointly enriched for known cancer census and PCAWG driver 
genes®, The list includes genes that are primarily altered at the DNA level 
(suchas 7P53), but also genes for which the alteration most frequently 
manifests in RNA (such as GAS7). Out of 87 samples from the PCAWG 
study that did not have a driver alteration at the DNA level>, and had 
RNA-sequencing (RNA-seq) data, every sample had an RNA-level altera- 
tion identified. Although cancer is thought to be driven by changes in 
DNA primarily, some driver alterations may manifest themselves via 
changes in RNA rather than DNA sequence mutations. 

We identified germline eQTLs for around 20% of expressed genes. 
The number of eGenes found is generally low compared with some 
other studies, reflecting the heterogeneity of our samples. Only 422 
genes appeared to be specific to cancer; this is likely to be an under- 
estimate owing to the heterogeneity, small sample numbers and the 
rather conservative strategy chosen. We have also mapped linkages 
between genes and somatic aberrations in cis, in which 68.4% of asso- 
ciations were between non-coding somatic variants and gene expres- 
sion. Allelic copy-number imbalance is a major determinant of ASE 
dysregulation in cancer. We found mutations associated with splicing 
changes including novel cancer-specific exons that can be partially 
explained by mutation-driven exonization. We systematically com- 
pared gene fusions with whole-genome rearrangements across many 
tumour types and found 82% of detected fusions were associated with 
specific genomic rearrangements. For the remaining fusions, it is pos- 
sible that the relevant genomic rearrangements have not been detected, 
or that some fusions happen directly at the RNA level, as trans-splicing 
or read-through events. The availability of whole-genome sequences 
allowed us to develop a systematic classification of fusion events and 
to propose a new bridged fusion mechanism. 

Because global differences in RNA expression phenotypes are largely 
tissue-specific, our ability to associate mutations in cis or trans are 
limited by the small and variable sample sizes within each histotype. 
Further work is needed to investigate other mechanisms of genome 
alteration that can lead to changes in RNA suchas epigenetic changes” 
or enhancer hijacking*”. Our work will help to prioritize further inves- 
tigations. 

Overall, our analyses show diverse modes of alteration of cancer 
genes and pathways at the DNA and RNA levels, and demonstrate 
that RNA analyses reveal cancer-associated pathway alterations that 
have not yet been detected via DNA-only approaches. These insights 
illustrate the power of integrated transcriptome and whole-genome 
sequencing analysis for cancer studies. 
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Methods 


RNA-seq alignment and quality-control analysis 

Tumour and healthy ICGC RNA-seq data, included in the PCAWG 
cohort?, was aligned to the human reference genome (GRCh37.p13) 
using two read aligners: STAR® (v.2.4.0i, two-pass), performed at 
MSKCC and ETH Zirich, and TopHat2” (v.2.0.12), performed at the 
European Bioinformatics Institute. Both tools used Gencode (release 
19) as the reference gene annotation. For the STAR two-pass align- 
ment, an initial alignment run was performed on each sample to gen- 
erate a list of splice junctions derived from the RNA-seq data. These 
junctions were then used to build an augmented index of the reference 
genome per sample. Ina second pass, the augmented index was used 
for amore sensitive alignment. Alignment parameters have been fixed 
to the values reported in https://github.com/ICGC-TCGA-PanCancer/ 
pcawg3-rnaseq-align-star. The TopHat2 alignment strategies also 
followed the two-pass alignment principle, but was performed in 
a single alignment step with the respective parameter set. For the 
TopHat2 alignments, the irap analysis suite® was used. The full set 
of parameters is available along with the alignment code in https:// 
hub.docker.com/r/nunofonseca/irap_pcawg/. For both aligners, the 
resulting files in BAM format were sorted by alignment position, 
indexed and are available for download in the GDC portal (https:// 
portal.gdc.cancer.gov/) and the ICGC Data Portal (https://dcc.icgc. 
org/). The individual accession numbers and download links can be 
found in the PCAWG data release table: http://pancancer.info/data_ 
releases/may2016/release_may2016.v1.4.tsv. Cancer-type abbrevia- 
tions are listed in Supplementary Table 23. Histology was derived 
from an older version released by the PCAWG Pathology and Clini- 
cal Correlates Working Group. Assignments of donor to histology 
used in this study can be found in the file rnaseq.extended.metadata. 
aliquot_id.V4.tsv.gz at https://dcc.icgc.org/releases/PCAWG/ 
transcriptome/metadata/. 

Quality control of all datasets was performed at three main levels: 
(1) assessment of initial raw data using FastQC® (v.0.11.3) (Supplemen- 
tary Fig. 4); (2) assessment of aligned data (percentage of mapped and 
unmapped reads for both alignment approaches); and (3) quantifica- 
tion (by correlating the expression values produced by the STAR and 
TopHat2 based expression pipelines) (Supplementary Fig. 2). In total, 
we defined six quality-control criteria to assess the quality of the sam- 
ples. We marked a sample as a candidate for exclusion if: (1) 3 out of 5 
main FastQC measures (base-wise quality, k-mer overrepresentation, 
guanine-cytosine content, content of N bases and sequence quality) 
did not pass; (2) more than 50% of reads were unmapped or fewer 
than1 million reads could be mapped in total using the STAR pipeline; 
(3) more than 50% of reads were unmapped or fewer than 1 million reads 
could be mapped in total using the TopHat2 pipeline; (4) we measured 
a degradation score® greater than 10; (5) the fragment count in the 
aligned sample (averaged over STAR and TopHat2) was <5 million; and 
(6) the correlation between the expression counts of both pipelines was 
<0.95. If a sample did not pass one of these six criteria it was marked 
as problematic and placed ona greylist. If more than two criteria were 
not passed, we excluded the sample. 

Asubset of 722 libraries from the projects ESAD-UK, OV-AU, PACA-AU 
and STAD-US were identified as technical replicates generated from 
the same sample aliquot. These libraries were integrated post- 
alignment for both the STAR and the TopHat2 pipelines using sam- 
tools into combined alignment files. Further analysis was based on 
these files. Read counts of the individual libraries were integrated 
to asample-level count by adding the read counts of the technical 
replicates. 

Initially, a total of 2,217 RNA-seq libraries were fully processed by 
the pipeline. Quality-control filtering and integration of technical 
replicates (722 libraries) gave a final number of 1,359 fully processed 
RNA-seq sample aliquots from 1,188 donors. 


GTEx data analysis 

For a panel of RNA-seq data froma variety of healthy tissues, data from 
3,274 samples from GTEx (phs000424.v4.p1) were used and analysed 
with the same pipeline as PCAWG data for quantifying gene expression. 
Alist of GTEx identifiers are provided at https://dcc.icgc.org/releases/ 
PCAWG/transcriptome/metadata. 


Quantification and normalization of transcript and gene 
expression 
STAR and TopHat2 alignments were used as input for HTSeq® (v.0.6.1p1) 
to produce gene expression counts. Gencode v.19® was used as the 
gene annotation reference. Quantification ona per-transcript level was 
performed with Kallisto® (v.0.42.1). This implementation is available 
as a Docker container at https://hub.docker.com/r/nunofonseca/irap_ 
pcawg. The implementation of the STAR and TopHat2 quantification 
is available as docker containers in: https://github.com/ICGC-TCGA- 
PanCancer/pcawg3-rnaseq-align-star and https://hub.docker.com/r/ 
nunofonseca/irap_pcawg/, respectively. Quantification of consensus 
expression was performed by taking the average expression based 
on STAR and TopHat2 alignments. Gene counts were normalized by 
adjusting the counts to FPKM” as well as FPKM with upper quartile 
normalization (FPKM-UQ) in which the total read counts in the FPKM 
definition has been replaced by the upper quartile of the read count 
distribution multiplied by the total number of protein-coding genes. 
The FPKM and FPKM-UQ calculations were as follows. 
FPKM =(C~x 10’)/(NL), in which N denotes the total fragment count to 
protein-coding genes, L denotes the length of the gene and C denotes 
the fragment count. FPKM-UQ = (C x 10°)/(ULG), in which U denotes 
the upper quartile of fragment counts to protein-coding genes on auto- 
somes unequal to zero, and G denotes the number of protein-coding 
genes on autosomes. 


t-Distributed stochastic neighbour embedding analysis 

The t-distributed stochastic neighbour embedding (t-SNE) plots 
in Supplementary Figs. 5 and 6 were produced using the RTsne 
package® (with a perplexity value of 3) based on the Pearson correla- 
tion of the aggregated expression (log + 1) of the 1,500 most variable 
genes. FPKM expression values per gene were aggregated (median) 
by tissue (GTEx) and study (PCAWG). Coefficient of variation for each 
gene was also computed per tissue (GTEx) and study (PCAWG) to deter- 
mine the 1,500 most variable genes. Purity values were previously 
described®. 

Thet-SNE plot in Extended Data Fig. 17c is based on all exon-skipping 
events in protein-coding genes confirmed by SplAdder”. Each event 
was quantified in both the PCAWG and GTEx cohort. All events with 
more than 1% of missing percentage spliced in (PSI) values across the 
concatenated PCAWG and GTEx samples were removed. The remaining 
missing values were imputed as the mean over the non-missing samples. 
The centred data were then visualized using the TSNE package from 
the Scikit Learn toolkit” with a perplexity value of 100, random state 
O and aninitialization with PCA. 


Associations between genetic variation and gene expression: 
patient cohort 

To associate genetic variation with gene expression, we analysed 
whole-genome sequencing (WGS) of the 1,188 donors with matched 
whitelisted RNA-seq data from the PCAWG cohort. Germline genotypes, 
SNV calls and segmented allele-specific SCNA calls were previously 
reported>. We matched 1,188 tumour RNA-seq IDs° to WGS whitelist 
tumour IDs (synapse entry syn10389164). For patients with multiple 
WGS IDs (2 out of 1,188) or RNA-seq aliquot IDs (17 out of 1,188), we 
resolved the matching by pairing samples with the same ‘tumor_wgs_ 
submitter_specimen_id’ (Supplementary Table 1). The 1,188 patients 
are spread across 27 types of cancer and 29 project codes and include 
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899 carcinomas; 34 patients are metastatic and 13 recurrent with the 
remaining patients being primary tumours (Supplementary Table 1). 

We used the data of these 1,188 patients for performing somatic and 
germline eQTL mapping, ASE analysis and association studies between 
gene expression and mutational signatures. 


Gene expression filtering 
Gene expression values (measured in FPKM; https://dcc.icgc.org/ 
releases/PCAWG/transcriptome/gene_expression) from consensus 
expression quantification as described above were used for this analysis. 
Genes with FPKM = 0.1in at least 1% of the patients (12 patients) were 
retained, resulting in 47,730 genes. Only 18,898 protein-coding genes 
(according to the ‘gene type’ biotype reported in Gencode v.19®) were 
used for the subsequent QTL analyses. The log,-transformed expres- 
sion values (FPKM + 1) were subjected to peer analysis” to account for 
hidden covariates (syn7850427; https://dcc.icgc.org/releases/PCAWG/ 
transcriptome/eQTL/phenotype). To balance the number of covariates, 
statistical power and available sample sizes per cancer type, we followed 
the GTEx protocol and estimated 15, 30 and 35 hidden covariates to be 
used depending on sample size” (n < 150, 150 <n< 250, n> 250). Peer 
residuals were then rank-standardized across patients. The FPKM cut-off 
values and peer correction were also applied to the subset of 899 patients 
with carcinoma, yielding 18,837 protein-coding genes after filtering. 
Furthermore, we used ordinary least-squares regression to correlate 
each of the 35 peer factors with per-sample covariates, including can- 
cer project codes, gender, tumour purity, somatic burden and several 
sequence metrics (Supplementary Notes), to understand the proportion 
of variance explained by known biological and technical covariates. 


Covariates 

In all linear models, we accounted for known confounding factors by 
modelling them as fixed effects. In all association studies, we accounted 
for sex, project code (describing cancer type and country of origin) 
and per-gene copy-number status (Supplementary Table 1 for the 
list of per patient covariates; syn7253568 and syn7253569 for sex and 
project codes; syn9661460 for per gene copy number). Per-gene copy- 
number alterations were derived as the average copy number across all 
copy-number aberrations called within the annotated gene boundaries 
based on syn8042988. 

The somatic eQTL, ASE and mutational signature analyses also 
accounted for total somatic mutation burden (number of SNVs and 
short insertions and deletions (indels)) and sample purity (Supplemen- 
tary Table 1). Purity was estimated based on copy-number segmenta- 
tion. In addition, the somatic eQTL and ASE analyses accounted for local 
SNV burden calculated in a 1-Mb window from the gene coordinates 
(https://dcc.icgc.org/api/v1/download?fn=/PCAWG/transcriptome/ 
eQTL/covariates/pergene.somatic.snv.cis.burden.1188.wl.donors.tsv.gz). 

The germline eQTL analysis also modelled the population structure 
as random effect. The population structure was assessed by a kinship 
matrix that was calculated based on every twentieth germline vari- 
ant, processed as described below (see ‘Germline eQTL variants’). The 
kinship matrix was then calculated as an empirical patient-by-patient 
covariance matrix. 

Different covariates were accounted for per-analysis method (Sup- 
plementary Table 1). The project code describes cancer type and 
country-of-origin. Somatic burden is the total number of SNVs and 
indels. Purity was estimated based on copy-number segmentation. 
Local somatic burden is the number of SNVs in a1-Mb window around 
the gene coordinates. Local copy number was defined as the average 
copy-number state across all SCNAs called within the annotated gene 
boundaries. 


GO and Reactome pathway enrichment 


We performed GO”” and Reactome pathway”*” enrichment with the 
Bioconductor packages biomaRt””, clusterProfiler’and ReactomePA” 
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(FDR <10%). The number of genes used as background set is described 
per analysis method. 


Germline eQTL variants 

PCAWG variant calls v.0.1° were downloaded from GNOS and processed 
following the PCAWG-8 protocol: (1) VCF files were indexed and merged 
using bcftools®. (2) All variants were filtered for ‘PASS’ flag. (3) All vari- 
ants were filtered for quality larger than 20. (4) Only bi-allelic sites 
were considered. 

HDFS files for each 100-kb chunk of the VCF files were generated, 
assuming additivity that was numerically encoded as O, 1 or 2 for 
homozygous reference, heterozygous or homozygous alternative state, 
respectively. For indels, we encoded the presence or absence of the vari- 
ant as O or 1, respectively. Each variant was normalized to mean O and 
standard deviation 1. Missing variants were mean-imputed. To create 
our eQTL release set v.1.0, the resulting HDF5 files were subsequently 
merged into a global HDF5S file and all variants which follow any of the 
following conditions were removed: (1) minor allele frequency < 1%; 
and (2) missing values > 5% 


Germline eQTL analysis 

In the germline eQTL analyses, we used the processed gene expres- 
sion dataset from 1,178 patients for which germline variant calls (eQTL 
release set v.1.0, see ‘Germline eQTL variants’) were available. Linear 
mixed models were used to model the correlation between germline 
variants (within 100 kb of gene boundaries) and gene expression val- 
ues (see ‘Gene expression filtering’) using the limix package®. Known 
covariates were modelled as fixed effects and population structure as 
random effect (see ‘Covariates’). 

Atwo-step approach was used to adjust for multiple testing. First, for 
each gene, we adjusted for the number of independent tests estimated 
based on local linkage disequilibrium®. Second, we performed a global 
correction across the lead variants, that is, the most significant SNPs, 
per eQTL. Germline eGenes were defined as genes with an eQTL with 
global FDR <5%. 


GTEx comparative analysis 

The GTEx comparative eQTL analysis was based on the eQTL maps 
v.6p’°. We mapped the positions and alleles of our PCAWG-specific 
eQTL to the eQTL in all GTEx tissues. To determine whether a lead 
eQTL variant is replicated in a given GTEx tissue, we followed the previ- 
ously described strategy’®. For each eGene, we considered the eQTL 
lead variant and assessed the replicability of the signal in the GTEx 
cohort based on marginal association statistics using 42 GTEx tissues 
without cell lines (P< 0.00024 = 0.01/42, corrected for the number 
of GTEx tissues—that is, 42)). If the lead variant did not replicate or 
was not tested, we determined replication based on the variant with 
the smallest P value within the linkage disequilibrium block (r’ > 0.8 
estimated based on UK10K project) of the lead variant across 25 (or 42) 
tissue-matched GTEx analyses. If neither lead nor any variant within 
the linkage disequilibrium block was tested, we determined replica- 
tion based on the smallest P value of any variant within the 100-kb 
window tested within the GTEx cohort. We also derived less stringent 
sets of PCAWG-specific eGenes by allowing replication in up to1,5 or 
10 GTEx tissues. 


Tissue sharing of germline eGenes between histotypes 

Using the R package qvalue (https://github.com/StoreyLab/qvalue, 
v.2.14.0), we generated 7, statistics comparing the lead variants of one 
histotype against their P value distribution in the other histotypes. 
Because 7, Statistics are known to be confounded by sample size and 
number of eQTL found, we subsampled the eQTL lead variants toa 
randomly selected set of 100 variants. After 20 rounds of subsampling, 
we derived the same 7, statistics as mentioned earlier and reported 
the average. 


Roadmap enrichment of germline eGenes 

For each lead variant, we generated a matching background set of 
1,000 variants using SNPsnap®. Each variant (background and fore- 
ground) was intersected with the location of 25 Roadmap factors” in 
127 celltypes. From this we derived fold change and Pvalues. Significant 
changes of fold change between PCAWG-specific and unspecific eQTLs 
is based ona one-sided Wilcoxon rank-sum test. 


Enrichment analysis 
Enrichment of Reactome pathways of PCAWG-specific eGenes was 
performed using the Bioconductor package ReactomePA”. 


Somatic calls and mutational burden 

We used the set of consensus SNVs somatic calls provided by PCAWG 
(syn7357330) based on three core caller pipelines and MuSE**. On 
average, we counted 22,144 somatic SNVs per patient, with different 
median numbers of SNVs per cancer type, ranging from 1,139 in thy- 
roid adenocarcinoma to 72,804 SNVs in skin melanoma (Extended 
Data Fig. 5a). Owing to the low frequency of somatic SNVs across the 
cohort (Extended Data Fig. 5b), we collapsed the variants by genomic 
regions defined by gene annotations (Gencode v.19). Specifically, we 
generated aset of disjoint gene exons by collapsing overlapping exon 
annotations into single features using bedtools®. The set of disjoint 
introns was generated using bedtools by subtracting the collapsed 
exonic regions from the gene regions. To map local effects of somatic 
mutations in flanking features outside the gene body, we binned the 
surrounding regions (plus and minus 1 Mb from the gene boundaries) 
into 2-kb windows (flanking) overlapping by 1 kb. 

We defined three different types of aggregated somatic burden to 
assess differences in power in detecting somatic eGenes and P value cali- 
bration. The burdenina genomic region was defined as (1) abinary value 
that indicates presence or absence of SNVs; (2) the aggregated burden 
as sum of SNVs; or as (3) weighted burden, thatis, sum of variant allele 
frequencies of the SNVs (Supplementary Fig. 10a) to take into account 
their clonality (https://dcc.icgc.org/releases/PCAWG/transcriptome/ 
eQTL/genotypes). We assessed calibration of all three analyses with Q-Q 
plots of nominal and permuted P values (permutation of the patients 
inthe gene expression matrix) (Supplementary Fig. 1Ob-d). Moreover, 
for the linear regression analysis, genotypes were standardized across 
patients (to mean zero and standard deviation one) and standardized 
effect sizes are provided in Supplementary Table 5. 

Overall, somatic burden within flanking regions was the most preva- 
lent type of burden tested per gene (Extended Data Fig. 6a). We found 
similar average relative mutation density per type of genomic region 
(flanking = 0.008 mutations per kb; introns = 0.007 mutations per kb; 
exons = 0.006 mutations per kb) (Extended Data Fig. 6b) and average 
recurrence of the same mutated region across the cohort was rather 
low (flanking = 1.4%; exons = 1.7%; introns = 4%) (Extended Data Fig. 6c). 


Somatic eQTL analysis 

Linear models were used to model the correlation between recurrent 
somatic burden and gene expression of up to 18,898 protein-coding 
genes, using the limix package®' (see ‘Gene expression filtering’). Gene 
expression was corrected for 35 hidden Peer factors. Known covariates 
were modelled as fixed effects (see ‘Covariates’). We considered only 
somatic burdens with frequency greater than 1%, including exonic 
and intronic burdens, as well as flanking burdens, within 1 Mb from 
gene boundaries. 

The somatic eQTL analysis was performed on all 1,188 patients and 
on the subset of 899 patients with carcinoma (representing 20 of the 
27 types of cancer) to replicate the analysis on a more homogeneous 
set of tumours. A cis window of 1 Mb from the gene boundaries was 
used to find mutated genomic intervals with a burden frequency = 1% 
in the cohort (at least 12 patients in the full cohort and 9 patients in 


the carcinoma cohort). Together, 18,708 of the genes had at least one 
mutated interval at that frequency and were included in the analysis 
and 1,049,102 regions showed a burden frequency > 1% 

Bonferroni correction was applied to correct for multiple cis windows 
tested within the same gene. Then, Benjamini-Hochberg correction was 
applied to adjust the Pvalues of the lead genomic regions across genes. 
Somatic eGenes were defined as genes with an eQTL at aFDR<5%. 


Somatic cis-eQTL comparative analysis 

We compared our 649 somatic eQTL set with three previous cancer 
studies®** ** to identify independent evidence of interaction between 
our eGenes and the associated cis-genomic regions with somatic bur- 
den. Studies were chosen if they provided lists of cancer regulatory 
elements linked to genes or regulatory elements with somatic muta- 
tions linked to gene expression deregulation in cancer. All the three 
studies examined were based on TCGA cancers. For this, we checked 
perfect overlaps with both the somatic burden location and the eGene. 
Moreover, we looked at the overlap between somatic eQTL and 72,987 
GeneHancer®’ enhancers-to-genes interactions, with at least two inde- 
pendent supporting methods (called ‘double-elite’), downloaded from 
the UCSC hg19 GeneHancer track”®. We then compared this overlap 
with a set of nulls generated by 1,000 random permutations of the 
GeneHancer regulatory elements with nearby genes located within 
1Mb. We then retrieved an empirical P value of enrichment by counting 
the number of random nulls (N) showing greater number of overlaps 
than those found between the somatic eQTL set and the GeneHancer 
set (P=(N+1)/(1,000 +1)). 


Functional enrichment in somatic cis-eQTL 

To identify putative regulatory sites enriched for somatic eQTL, we 
retrieved functional annotations of the lead genomic flanking inter- 
vals of the somatic eQTL (556 intervals linked to 638 somatic eQTL). 
Therefore, we mapped somatic eQTL to 25 Roadmap Epigenomics 
chromatin marks of 127 different cell types°and ENCODE transcription- 
factor binding site annotations in 9 cell types (including 8 cancer and 
one embryonic stem-cell lines”) (Supplementary Tables 6 and 7). We 
compared annotations inthe significant set of eQTLs with a null distri- 
bution based 0n1,000 random samplings of a matched set of genomic 
intervals. To define the matched sets of genomic intervals, we selected 
flanking genomic intervals from the whole set of tested genes that 
showed a similar distance from the gene start (exact distance + 2 kb) 
and that matched the exact burden frequency of the corresponding 
interval in the significant associations. We then overlapped the 1,000 
matched sets with Roadmap Epigenomics and ENCODE annotations. To 
avoid ambiguous overlaps (with multiple annotations), we retained only 
genomic intervals showing a minimum overlap of 10% of their length. 

We retrieved an empirical P value of enrichment for each annota- 
tion by counting the number of randomly sampled flanking intervals 
(N) showing greater number of overlaps compared to the eQTL set 
(P=(N+1)/(1,000 + 1)). Benjamini-Hochberg correction was applied 
to the empirical P values (over 25 marks in 127 cell lines for Roadmap 
Epigenomics annotations and over 149 transcription-factor-binding 
sites for 9 ENCODE cell lines). We then computed the fold change per 
annotation and cell line as a ratio of annotated lead flanking intervals 
and mean number of annotated matched random flanking intervals 
over the 1,000 samplings. 

Furthermore, we performed GO” and Reactome pathway?” enrich- 
ment with the Bioconductor packages biomaRt””, clusterProfiler’® and 
ReactomePA” (FDR<10%) and also looked at enrichment within high- 
confidence cancer testis genes previously described”, using 18,708 
genes with at least one mutated interval as background. 


Variance component analysis 
Limix was used to perform variance decomposition using the 
same covariates as in the somatic variant analyses except for local 
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copy-number state (see ‘Covariates’). The random effects were based 
onthe following common germline variants and somatic burden (fre- 
quency > 1%) (see ‘Somatic calls and mutational burden’ for detailed 
description of burden): (1) cis-somatic intronic: weighted burden 
in introns; (2) cis-somatic exonic: weighted burden in exons; (3) cis- 
somatic flanking: weighted burden in1-kb-overlapping regions of 2 kb 
within 1 Mb from gene boundaries; (4) somatic intergenic: weighted 
burden in 1-kb-overlapping regions of 2 kb outside the 1 Mb window; 
(5) cis-germline: germline variants within 100 kb from gene boundaries; 
(6) trans-germline: genome-wide population structure (see ‘Covari- 
ates’); and (7) local copy-number variation (see ‘Covariates’). 

All the data was mean-centred and standardized. For each of the 
random effects, a linear kernel was computed and used as covariance 
matrix. The resulting variance components were normalized to add 
up to one. 


Mutational signature associations 

We obtained 39 mutational signatures from PCAWG-7 beta 2 release” 
and used linear models to associate the mutational signatures with gene 
expression of up to 18,898 protein-coding genes across 1,159 patients 
while accounting for known covariates (see ‘Covariates’) (quality con- 
trol) (Extended Data Fig. 10a—e). The 1,159 patients were a subset of 
the total 1,188 patients, for whom mutational signature profiles were 
available. Gene expression was corrected for 35 hidden peer factors 
(see ‘Gene expression filtering’). 

We retained 18,888 genes that showed a minimum FPKM of 0.1in at 
least 1% of 1,159 the patients (see ‘Gene expression filtering’). Signatures 
with zero variance and a prevalence below 1% were filtered, and we 
obtained 28 signatures. We applied linear models to associate expres- 
sion of these genes with the signatures across all 1,159 patients, asubset 
of 877 patients with carcinoma or a subset of 891 European patients 
to assess consistency of the associations (Extended Data Fig. 10f, g). 

Across all patients, we found 1,176 significantly associated genes 
after Benjamini-Hochberg correction (we used an FDR <10% for enrich- 
ment analyses, multiple testing was applied across all signature-gene 
pairs) (Supplementary Tables 19a—c). We performed gene enrichment 
analyses of the significant genes per signature (see ‘GO and Reactome 
pathway enrichment’) (here 18,831 background genes, multiple testing 
correction across all ontologies per signature FDR < 10%) (Supplemen- 
tary Table 19d). Whereas most signatures were associated with only few 
genes, 18 showed recurrent trans effects and affected expression of 
over 20 genes (Extended Data Fig. 11d, Supplementary Table 19e). We 
further found that the vast majority of genes (85.8%) were associated 
with only one signature (1,009 genes); 129 genes were associated with 
two, 32 with three, 5 with four and 1 with five signatures. 

To assess how tissue-specific both mutational signatures and their 
associations with gene expression are, we analysed the occurrence of 
each signature in each of the types of cancer. We assessed the presence 
(at least one SNV of asignature in at least one patient witha specific can- 
cer type) and mean prevalence (mean number of SNVs of a certain sig- 
nature across all patients of a specific cancer type) of the signaturesin 
the types of cancer (Extended Data Fig. 13c, d). We defined cancer-type- 
specific signatures to occur in up to four types of cancer (signatures 4, 
7,9, 12,16, 38 and 39) and common signatures to be missing in up to five 
types of cancer (signatures 2, 13 and 18). For each of these signatures, 
we performed cancer-type-specific analyses, that is, we assessed the 
association between the respective signature and gene expression in 
just the patients who are of acancer type that shows mutations of the 
respective signature (Extended Data Fig. 13c, left heat map). We then 
correlated the Pvalues of these cancer-type-specific analyses with the 
P values of the analysis across all patients and calculated the Pearson 
correlation coefficients (Supplementary Fig. 24a-e). We show that the 
correlation between cancer-type-specific and whole-cohort P values 
is dependent on the sample size of the respective analysis (7? = 0.671) 
(Supplementary Fig. 1f). 


We further performed PCA on the signatures across both, patients 
(PCA on signature-specific SNVs per patient) and genes (PCA on 
adjusted Pvalues of signature-gene expression associations) (Extended 
Data Fig. 11a, b). 

Toassess significance of the functional annotation of SNVs by muta- 
tional signatures, we also associated gene expression with the total 
number of SNVs and correlated the P values (—log,,(P)) of the asso- 
ciations with the respective signature-specific P values. The absolute 
Pearson correlation coefficients remain below 0.1 (Supplementary 
Table 19f). 

To establish causality of signature-gene expression associations, we 
included the germline eQTL into the analysis using linear mixed models; 
197 of our 1,176 signature-associated genes were also germline eGenes. 
These 197 associations involved 26 of the 28 mutational signatures. We 
associated the lead variants of these eGenes with the rank-standardized 
signature SNVs across 2,507 patients. We used the subset of the 2,818 
WGS patients for which mutational signature profiles and all known 
covariates were available. We accounted for the same fixed covariates 
as inthe mutational signature-gene expression association studies and, 
in addition, for kinship as a random effect (see ‘Covariates’). 

We then performed proportional colocalization analysis with Bayes- 
ian model averaging using the R package coloc” to test whether gene 
expression and mutational signatures share common causal genetic 
variants in a given gene region. A proportional colocalization analy- 
sis tests the null hypothesis of colocalization by assuming that two 
phenotypes that share causal variants will have proportional regres- 
sion coefficients for either phenotype with any variant selection inthe 
vicinity of the causal variant. We applied the Bayesian model averag- 
ing approach, with each tested model consisting of a selection of two 
variants. The P values are then averaged over all models to generate 
posterior predictive P values”. We filtered variants so that no pair of 
variants showed r’ > 0.95 and each variant’s marginal posterior prob- 
ability of inclusion with one of the phenotypes was greater than 0.01. 
The nominal Pvalues of rejecting the null hypothesis of colocalization 
are listed in Supplementary Table 19e. 

We then performed mediation analysis” to assess directionality 
of the effect between germline eQTL, gene expression and mutational 
signature. First, causal mediation analysis was applied to each of the 
triples of eQTL lead variant, gene and mutational signature using a 
structural equation model from the R package lavaan”®. Then, we used 
the R package mediation” to assess significance of mediation and esti- 
mate the proportion of mediated effect by non-parametric bootstrap 
confidence intervals (1,000 simulations). 


ASE analysis: assembling phased germline and somatic variants 
To understand the precise effect of somatic variations in their genomic 
context and for subsequent allele-specific analyses, both germline and 
somatic variants were phased. For assembling phased germline geno- 
types, we used the Sanger 1000G callset®, and applied IMPUTE2” for 
phasing of heterozygous germline variants. The IMPUTE2 output was 
corrected using results from the Battenberg CN calling algorithm” to 
ascertain that no haplotype switches occur within regions of consecu- 
tive copy-number gain. The resulting phased germline genotypes were 
arranged such that haplotype 1 always corresponded to the amplified 
alleles in regions with SCNAs (major allele). In cases in which both co- 
occur onthe same NGS read (approximately 10 million variants, 20% of 
all SNVs), we phased individual somatic variants to the nearest germline 
heterozygous site. For downstream analyses, we considered only SNVs 
that were phased by at least three reads to the respective germline 
variant (approximately 6 million out of 10 million SNVs). 

All phased SNVs were aggregated into functional categories based 
on their genomic regions defined by gene annotations (upstream, 
downstream, promoter, 5’ UTR, intron, synonymous, missense, stop 
gain and 3’ UTR) and mapped to the nearest gene within a cis window 
of 100 kb using the Variant Effect Predictor (VEP) tool!©°. Promoter 


variants were defined as 1-kb upstream of the TSS. We included flanking 
regions by using the VEP ‘UpDownDistance’ plugin with a maximum 
range parameter of 100 kb. We divided the upstream and downstream 
variant categories into disjoint categories using 10-kb windows from 
10 to 100 kb. We integrated ‘splice donor’ and ‘splice acceptor’ vari- 
ants into the general ‘splice region’ variant category and mapped ‘stop 
retained’ variants to the ‘synonymous variant category. We averaged 
transcript-level annotations to gene-level annotations to retrieve the 
expected functional effect of a variant for a given gene. We analysed 
the relationship between SNV variant allele frequency and SCNAs at 
the same locus to determine whether variants occurred before (‘early’) 
or after (‘late’) the corresponding SCNA (PCAWG-11). We computed a 
weighted cis-mutational burden per category by estimating the can- 
cer cell fraction of each SNV and aggregating SNVs to a total localized 
burden weighted by their respective cancer cell fraction. 


ASE read counts 

The positional information of the heterozygous germline variants 
was used together with the RNA-seq BAM files as input to the GATK 
ASEReadCounter™ algorithms for counting ASE reads. We considered 
reads witha minimum mapping quality of 20 and a minimum base qual- 
ity of 10. Only heterozygous variants with a minimum coverage of eight 
RNA-seq reads were considered for all further analyses. 

The raw ASE read counts were post-processed as follows: (1) ASE sites 
were converted to BED files and aligned against the ENCODE 50-mer 
mappability track (wgEncodeCrgMapabilityAlignSOmer.bigWig) to 
extract mappability scores for all sites. All sites with mappability scores 
unequal to 1 were removed. (2) All sites with allelic read counts less or 
equal to 1 were removed to prevent genotyping error to influence ASE 
quantification. (3) All sex chromosomes were dropped for further 
analysis. (4) We estimated sequencing error per patient as the sum 
of non-reference and non-alternative bases over the total number of 
bases. We assessed statistical mono-allelicity through a binomial test 
using the estimated sequencing error probabilities, corrected using the 
Benjamini-Hochberg step down procedure. All sites that appeared to 
be statistically mono-allelic were removed. (5) For each ASE site, copy- 
number states were retrieved from the Sanger copy-number consensus 
callset (PCAWG-11). Purity estimates for each patients were retrieved 
from the accompanying purity tables. 

To aggregate site-level ASE to a gene-level readout and to allow for 
estimation of effect directionality, we used the phased germline geno- 
types. Gene mapping was performed against ENSEMBL release 75 using 
the pyEnsembI Python library. We retrieved all genes at each ASE site 
and summed up the read counts on the respective haplotypes to gene- 
level haplotype-specific read counts. We further averaged haplotype- 
specific copy-number states toa mean haplotype-specific copy-number 
state per gene and computed the gene-level copy-number ratio as the 
major over total ratio of those averages. To allow for a robust assess- 
ment of gene-level ASE, we considered only genes with at least 15 reads 
total, yielding 4,379,378 gene-patient pairs of 1,120 patients and 17,009 
unique genes across 12,441,502 accessible sites in total. Every remain- 
ing gene was tested for AEI using a binomial test against an expected 
read ratio of 0.5 to derive nominal P values, and a binomial test against 
the expected copy-number ratio modified by tumour purity to derive 
copy-number-corrected Pvalues. Nominal and copy-number-corrected 
P values were adjusted separately for multiple testing using the Ben- 
jamini-Hochberg procedure. Significant AEI was called at FDR < 5%. 
We further annotated each gene with the number of ASE sites used for 
aggregation. For all downstream analyses, we considered only genes 
annotated as protein coding (ENSEMBL biotype =‘protein_coding’). 


Generalized linear models 

Across all 4,379,378 gene-patient pairs, we trained multivariate linear 
models using (i) logistic regression against a binary indicator of AEI 
absence or presence ina gene, or (ii) standard linear regression against 


the phased ASE ratio of a gene to assess the directionality of the regula- 
tory change. For (i), haplotype-specific mutations were summed up to 
a total burden per category, whereas for (ii) we used the difference in 
burden between the haplotypes 1 and 2. The consistency of the phas- 
ing map between somatic variants and ASE sites ensured that model 
coefficients kept their directionality independent of the arbitrary 
labelling of haplotypes as 1 or 2. The full set of considered factors is as 
follows: (1) copy-number ratio at the gene locus (0.5<x <1); (2) sample 
purity (O <x <1); (3) natural logarithm of total gene length (x > 0); (4) 
natural logarithm of the length of the canonical transcript (x > 0); (5) 
heterozygosity of the lead eQTL variant (x= 0 if homozygous, x=1ifnot 
homozygous); (6) all mutational burden categories as determined by 
VEP annotations (upstream in 10-kb windows, downstream in 10-kb win- 
dows, promoter, 5’ UTR, intron, synonymous, missense, stop gain and 
3’ UTR; x= 0 for logistic model, x € R for directed model). 

To compare global effects and different contributions of SCNA, ger- 
mline eQTL, coding and non-coding SNVs, a simplified logistic model 
was trained after accumulating all coding and non-coding variants to 
separate categories and reporting standardised effect sizes (Fig. le). 


Cancer gene enrichment 

Cancer gene enrichment was conducted on the COSMIC census” 
using Fisher’s exact test and gene set enrichment analysis as previ- 
ously described’”. For enrichment, the average score of a gene was 
computed across the cohort and only genes with at least five replicates 
inthe cohort were kept, yielding a total of 16,078 genes. 


Chromosomal distribution of ASE 

We calculated the recurrence of ASE genes in each tumour type. To 
examine the chromosomal distribution of ASE genes, we calculated 
the average recurrence of all genes for every 200-gene window with 
a10-gene step, and then subtracted the average ASE occurrence in 
each tumour type to obtain the peaks of ASE surplus across all chro- 
mosomes. The recurrence of copy-number genes was calculated inan 
analogous manner. 


Estimation of alternative promoter activity 

We estimated promoter activities using RNA-seq data and Gencode 
(release 19) annotations for 70,937 promoters in 20,738 genes. We 
grouped transcripts with overlapping first exons under the assumption 
that they are regulated by the same promoter’. TSSs that are located 
within internal exons, or which overlap with splice acceptor sites, were 
removed from this analysis as these promoters are difficult to estimate 
from RNA-seq data”®. Promoter activity can be estimated using exon 
usage”’, spliced reads”’ or isoform-based estimates*°. Here we used an 
isoform-based approach to quantify promoter activity. We quantified 
the expression of each transcript from the RNA-seq data using Kallisto® 
and calculated the sum of expression of the transcripts initiated at 
each promoter to obtain an estimate of promoter activity. To obtain 
the relative activity for each promoter, we normalized each promoter’s 
activity by the overall gene’s expression. We divided the promoters 
of each gene into three categories based on their average pan-cancer 
promoter activity. The promoters with <1 FPKM average activity are 
called inactive promoters, and the most active promoter of each gene 
is called the major promoter. The remaining active promoters of the 
gene are called minor promoters. 

The association between promoter activities and promoter mutation 
burden was estimated using the same framework as the somatic eQTL 
analysis. We examined associations for the promoters of expressed 
multi-promoter genes with a burden frequency = 1% in the cohort (at 
least 12 patients in the full cohort). The weighted burden of the region 
1-kb upstream of the TSS—that is, the sum of variant allele frequencies 
of the SNVs for each gene—was used as the genotype for the promoters 
of the respective genes. We used linear models to study the associations 
between the recurrent somatic burden and the promoter activity (both 
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for the relative activity and the log,-transformed absolute activity). 
Similar to the somatic eQTL analysis, the known covariates and the 35 
hidden peer factors were provided as cofactors to the linear models. We 
adjusted the P values using Benjamini-Hochberg correction method 
and looked for associations with FDR < 5%. 


Identification of alternative splicing 

We used the alignments based on the STAR pipeline to collect and quan- 
tify alternative splicing events with SplAdder”. The software has been 
run with its default parameters with confidence level 3. We generated 
individual splicing graphs for each RNA-seq sample for both tumour 
samples as well as matched healthy samples (when available). All graphs 
were then integrated into a merged graph to comprehensively reflect 
all splice junctions observed in all samples together. On the basis of 
this combined graph, SplAdder was used to extract alternative splicing 
events of the following types: alternative 3’ splice site, alternative 5’ 
splice site, cassette exon, intron retention, mutually exclusive exons, 
coordinated exon skip (see supplementary figure 3 in ref. ”°). Each 
identified event was then quantified in all samples by counting split 
alignments for each splice junction in any previously identified event 
and the average read coverage of each exonic segment involved inthe 
event was determined. We then computed a PSI value for each event 
that was then used for further analysis. We further generated different 
subsets of events, filtered at different levels of confidence, in which 
confidence is defined by the SplAdder confidence level (generally 2), 
the number of aligned reads supporting each event, the number of 
samples that were found to support the event by SplAdder, and the 
number of samples that passed the minimum aligned read threshold. 


Enrichment of outlier splicing associated with splice sites and 
branchpoint motifs 

We assessed the significance of mutational enrichment for 5’ and 3’ 
splice sites, and branch-point’™ intronic regions using a permutation- 
based approach. Impactful mutations were defined as mutations over- 
lapping exons and introns involved in cassette exon events, in which the 
PSI-derived z-score was > 3 or <—3. For each intronic site, we compared 
the frequency of observed impactful mutations against frequencies of 
randomly sampled intronic regions (number of iterations = 1,000). For 
exonic sites, the null distribution was established from randomly sam- 
pled exonic sites. Randomly sampled sites were within a100-bp window 
around the5’ and 3’ splice site. For branch-point regions, sampled sites 
were within a 50-bp window around the branch-point sequence. The 
Pvalue was computed as the number of randomly sampled frequencies 
greater or equal to the observed frequency. 


SAVNet analysis for identifying rare SAVs 

The SAVNet approach® was designed for identifying somatic vari- 
ants associated with local aberrant splicing alterations from matched 
genome and transcriptome sequencing data. It uses permutations 
to calculate an FDR and by restricting to two classes of relationships 
between somatic mutations and splicing alterations to focus: (1) splice 
site disruption, in which exon skipping, alternative 5’ or 3’ splice site, 
or intron retention is associated with a mutation ina splice site motif; 
and (2) splice site creation, in which alternative 5’ or 3’ splice sites are 
associated with mutations that create a novel splice motif (FDR < 10%) 
(Extended Data Fig. 17e). 


Identification of RNA fusions 

Gene fusions between any two genes were identified based on two gene 
fusions detection pipelines: FusionMap (v.2015-03-31) pipeline’” and 
FusionCatcher (v.0.99.6a)/STAR-Fusion (v.0.8.0) pipeline’. ChimerDB 
3.0 was used as a reference of previously reported gene fusions. The 
database contains 32,949 fusion genes split into three groups: (1) KB: 
1,067 fusion genes manually curated based on public resources of 
fusion genes with experimental evidences; (2) Pub: 2,770 fusion genes 


obtained from text mining of PubMed abstracts; and (3) Seq: archive 
with 30,001 fusion gene candidates from deep-sequencing data. This 
set includes fusions found by re-analysing the RNA-seq data of the 
TCGA project encompassing 4,569 patients from 23 types of cancer. 

In brief, FusionMap was applied to all unaligned reads from the 
PCAWG aligned TopHat2 RNA-seq BAM files for each aliquot to detect 
gene fusions. In the FusionCatcher/STAR-Fusion pipeline, for each ali- 
quot with paired-end RNA-seq reads FusionCatcher was applied tothe 
raw reads, withthe genome reference. Specifically, for each aliquot with 
paired-end RNA-seq reads FusionCatcher was applied to the raw reads. 
The‘-U True; -V True’ runtime options were used. For each aliquot with 
single-end RNA-seq reads, STAR-Fusion was applied to the raw reads, 
with the same reference genome and gene models as FusionCatcher and 
with default settings. In parallel, FusionMap was applied to all unaligned 
reads from the PCAWG aligned TopHat2 RNA-seq BAM files for each 
aliquot to detect gene fusions with the following non-default options 
values: MinimalHit = 4; OutputFusionReads = True; RnaMode = True; 
FileFormat = BAM. 

To reduce the number of false-positive fusions, the two sets of fusions 
were filtered to exclude fusions based on the number of supporting 
junction reads, sequence homology, and occurrence in normal samples 
(from the GTEx and the PCAWG cohort). To get a high-confident con- 
sensus fusion call set from these two pipelines, a fusion to be included 
inthe final set of fusions had to: (i) be detected by both fusion detection 
toolsin atleast one sample; and/or (ii) be detected by one of the meth- 
ods and have a matched structural variant in at least one sample. The 
consensus WGS-based somatic structural variants (v.1.6) were obtained 
fromthe PCAWG repository in https://dcc.icgc.org/releases/PCAWG. 

For integration with matched structural variant evidence, a fusion 
was considered to match a structural variant if the absolute distance 
between the fusion break points and structural variant break points 
did not exceed 500 kb (the distance was considered infinite when the 
chromosomes of the fusion and structural variant break point differ). 
When there was no evidence for a direct structural variant fusion, the 
search was expanded to look for composite fusions. In this case, an 
exhaustive search was performed to look for two structural variants 
with break points close to the fusion break points and with an effective 
distance smaller than 250 kb. 

Finally, 3,540 fusion events were included as the consensus fusion 
call set, from these 2,268 were detected by both FusionCatcher/STAR- 
Fusion and FusionMap (from these, 1,821 had matched structural variant 
evidence) and 1,112 were detected by only one method and had matched 
structural variance evidence. 

In total, approximately 36% of all detected fusion transcripts were 
predicted to bein-frame, several UTR-mediated fusion transcripts pre- 
serve complete coding sequences of one fusion partner. These include 
a known fusion 7BL1XR1-P/K3CA ina breast tumour and a notable new 
example C7BP2-CTNNB1 in a gastric tumour. 

All fusions are available in Synapse: https://dcc.icgc.org/releases/ 
PCAWG/transcriptome/fusion. 


Identification of RNA-editing events 

We used an RNA-editing events calling pipeline, whichis animproved 
version of that previously published’. First, we summarized the base 
calls of pre-processed aligned RNA reads to the human reference in 
pileup format. Second, the initially identified editing sites were then 
filtered by the following quality-aware steps: (1) the depth of candidate 
editing site, base quality, mapping quality and the frequency of varia- 
tion were taken into account to doa basic filter: the candidate variant 
sites should be with base-quality > 20, mapping quality >50, mapped 
reads > 4, variant-supporting reads > 3, and mismatch frequencies 
(variant-supporting-reads/mapped-reads) > 0.1. (2) Statistical tests 
based on the binomial distribution B(n, p) were used to distinguish 
true variants from sequencing errors on every mismatch site’, in 
which p denotes the background mismatch rate of each transcriptome 


sequencing, and n denotes sequencing depth on this site. (3) Discard 
the sites present in combined DNA SNP datasets (dbSNP v.138, 1000 
Genome SNP phase 3, human Dutch populations™°, and BGl in-house 
data; combined datasets deposited at: ftp://ftp.genomics.org.cn/ 
pub/icgc-pcawg3). (4) Estimate strand bias and filter out variants with 
strand bias based on two-tailed Fisher’s exact test. (5) Estimate and 
filter out variants with position bias, such as sites only found at the 3’ 
end or at5’end of aread. (6) Discard the variation site in simple repeat 
region or homopolymer region or <5 bp from splicing site. (7) To reduce 
false positives introduced by misalignment of reads to highly similar 
regions of the reference genome, we performed a realignment filter- 
ing. Specifically, we extracted variant-supporting reads on candidate 
variant sites and realign them against a combination reference (hg19 
genome plus EnsembI transcript reference v.75) by bwa0.5.9-r16. We 
retain a candidate variant site if at least 90% of its variant-supporting 
reads are realigned to this site. Finally, all high confident RNA-editing 
sites were annotated by ANNOVAR"™. (8) To remove the possibility of 
an RNA-editing variant being a somatic variant, the variant sites are 
positionally filtered against PCAWG WGS somatic variant calls (9). 
The final two steps of filtering are designed to enrich the number of 
functional RNA editing sites. First, we keep only events that occur 
more than two times in at least one cancer type. Second, we keep 
only events that occur in exonic regions with a predicted function 
of missense, nonsense or stop-loss. The final step of filtering within 
exonic regions with a specific predicted function induces the largest 
difference in observed frequencies of RNA-editing events between 
our analysis and the published one!°’. A comparative depiction of 
the frequencies of RNA-editing events identified in our analysis (Sup- 
plementary Table 24) and the previously published analysis’ is seen 
in Supplementary Fig. 23. 


Gene-centric table creation 

To perform joint analysis across RNA and DNA alterations, each altera- 
tion type was condensed into a binary gene-centric format. Because 
alterations occur at many different scales (nucleotide, exonic, gene or 
transcript), to make them comparable we projected each alteration 
type onto the gene body. We summarized each alteration type by its 
presence or absence within a single gene, yielding a binary value per 
type for each gene-sample pair. 

The events we included in this analysis were: RNA editing, non- 
synonymous variants, expression, splicing alterations, copy-number 
alterations, fusions and alternative promoters. Each alteration type was 
summarized differently owing to their inherent differences. 

RNA-editing events and non-synonymous variants can occur several 
times within a single gene body, so these events were denoted as 1if 
they occurred at least once within a gene-sample pair. 

For copy number, to obtain a single numerical value per gene-sample 
pair, the copy-number alteration was averaged over the gene body. 
Because we do not have matched normal samples against which to 
compare, we instead consider outlying events within each histotype 
as significant. Thus, a value of 1 was given to average copy-number 
alterations larger than 6 or smaller than 1. 

Similar to non-synonymous variants, multiple splice events can 
occur within a gene body. The event with the most extreme PSI value 
within the gene body is selected as the candidate event for the gene. 
The candidate’s PSI value for a gene is compared over all samples within 
a histotype and it is set to 1 (that is, significant) only if it the absolute 
value of its z-score is larger than 6 and the standard deviation is larger 
than 0.01 within that histotype. 

Similar to expression outliers, we calculate a z-score using the log- 
transformed upper-quartile normalized FPKM values with a pseudo- 
count of 1. All genes within a histotype with a standard deviation larger 
than zero and an absolute value larger than three were identified as an 
outlier. Alternative promoter outliers were calculated based on relative 
promoter activity within each cancer type. To binarize the promoter 


activity, a z-score cut-off of two over the relative expression distribu- 
tion within each cancer type was used. 

For ASE outliers, only genes with significant allelic imbalance 
(FDR < 5% and allelic imbalance > 0.2, binomial test) were denoted as 
1. All ASE events that were identified were further filtered to keep only 
genes that have not been identified as imprinted”®. 

In addition to the z-score-filtering mentioned above, we further fil- 
tered non-synonymous SNVs, RNA-editing events and splicing events 
such that they either induce a frameshift or the alternative region con- 
tains an HGMD variant” of the category ‘damaging’. 

It must be noted that in many cases, the z-score calculated is not 
froma Gaussian distribution, so some events may be missed or falsely 
included. Through our choice of very stringent z-score thresholds and 
functional filters, we hope that spurious outlier events are minimized. 


Pathway analysis 

For our pathway analysis, we used the TCGA pathway definitions to 
examine genes and pathways that have several alterations at both the 
DNA and RNA level”. 


Co-occurrence analysis 

The co-occurrence analysis was also performed onthe aforementioned 
binarized gene-centric table, but only including variants, expression 
outliers, alternative promoters, alternative splicing and fusions. 
SCNA and ASE are excluded owing to a large number of anticipated 
co-occurrence. In this analysis, we required at least one gene of agiven 
alteration pair to bea COSMIC gene. For each alteration pair, based on 
the number of donors with both alterations, one alteration only and 
neither alterations in a set of cancer samples, we performed Fisher’s 
exact test to determine whether the alteration pair was independent 
of each other. Such tests were followed by Benjamini-Hochberg mul- 
tiple testing correction to obtain the FDR (or q values). To rule out the 
potential false-positive association caused by tissue-specific altera- 
tions, we performed the same analysis for each of the tumour types 
with at least 50 patients, and retained only those alteration pairs that 
were significantly associated in both the pan-cancer analysis and in at 
least one specific cancer indication. Among the significantly associ- 
ated alteration pairs, the co-occurred pairs were those with odds ratio 
greater than 1. Pathway enrichment and visualization””"* were con- 
ducted using the R package ReactomePA”. The circos plots were gen- 
erated using the R package circlize"’. The splicing related genes were 
derived from the genes annotated as ‘REACTOME_MRNA SPLICING’ 
or‘REACTOME_MRNA_SPLICING_MINOR_PATHWAY in the Molecular 
Signatures Database (MSigDB)”*. 


Identifying genes with heterogeneous mechanisms of 
alterations in cis 

Genes with multiple heterogeneous mechanisms of RNA alteration 
were identified from associations of cis variants with gene expression, 
ASE, fusions and splicing. For gene expression, genes associated with 
somatic eQTL with FDR <5% were selected. For ASE, the top 5% of genes 
ranked by the predicted contribution of somatic variants on ASE. For 
fusions, all RNA fusions with structural variant support were selected. 
For splicing, genes having somatic mutations within 10 bp of an anno- 
tated splice site or 3 bp ofa branch point and associated splicing were 
selected. These associated splicing events also had to have a |z-score| 
greater than or equal to 3 and the difference of percent spliced in the 
outlier event was greater than or equal to 10%. 


Recurrence analysis 

The recurrence analysis was performed on the binarized gene-centric 
table for all nine alteration types. The recurrence analysis was per- 
formed in three main steps: (1) Aggregate within each alteration type 
across all samples. This results in a sum for each gene-alteration pair. 
(2) Convert the counts to ranks within each alteration. The smallest rank 
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goes tothe most frequently altered genes. Ranks are split evenly across 
ties. (3) To generate a single score for each gene, the second smallest 
rank across alterations is used as the score. To identify a score cut-off 
value for significantly altered genes, a null distribution was generated 
through permutation. The permutations were performed over the 
samples within each gene-alteration pair, this was done over all genes 
and samples 1,000 times, concatenating together all observations, 
results in 16.8 million permuted scores. P< 0.05 as derived from the 
null distribution was defined as significant, resulting in a score greater 
than or equal to 774 considered as significant. 

WExT”” was used to test the significance of mutually exclusivity 
of RNA and DNA alterations. As further evidence that CDK12 altera- 
tions may have a functional affect, we find evidence of the previously 
detected link® between a large tandem duplicator phenotype (here 
defined as more than 10 tandem duplications of size greater than 100 
kb) and CDK12 somatic eQTL mutation (7 out of 18 somatic eQTL car- 
riers are also among the 215 large tandem duplicator cases, P=0.032, 
hypergeometric test). 


Statistical tests 

Allcommon statistical tests are two-sided unless otherwise specified. 
No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Somatic and germline variant calls, mutational signatures, subclonal 
reconstructions, and other core data generated by the ICGC and TCGA 
PCAWG Consortium are described in an accompanying Article’ and 
are available for download at https://dcc.icgc.org/releases/PCAWG. 
Additional information on accessing the data, including raw read files, 
can be found at https://docs.icgc.org/pcawg/data/. In accordance with 
the data access policies of the ICGC and TCGA projects, most molecu- 
lar, clinical and specimen data are in an open tier that does not require 
access approval. To access potentially identification information, such 
as germline alleles and underlying sequencing data, researchers will 
need to apply to the TCGA data access committee via dbGaP (https:// 
dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA 
portion of the dataset, and to the ICGC data access compliance office 
(http://icgc.org/daco) for the ICGC portion of the dataset. In addition, 
to access somatic SNVs derived from TCGA donors, researchers will 
also need to obtain dbGaP authorization. Data derived specifically from 
RNA-seq analysis can be found at https://dcc.icgc.org/releases/PCAWG/ 
transcriptome. Subfolders contain identification and quantification 
of alternative promoter usage, alternative splicing, RNA fusions, gene 
expression, transcript-level expression and RNA editing. Identified eQTLs 
areinhttps://dcc.icgc.org/releases/PCAWG/transcriptome/eQTL anda 
binarized table indicating all RNA and DNA alterations for each gene can 
be found in the subfolder https://dcc.icgc.org/releases/PCAWG/tran- 
scriptome/recurrence_analyses/.In addition, quality-control metrics and 
metadata are also included. Some datasets are denoted with synXXXXX 
accession numbers and available at Synapse (https://www.synapse.org/). 


Code availability 


The core computational pipelines used by the PCAWG Consortium for 
alignment, quality control and variant calling are available to the public 
at https://dockstore.org/search?search=pcawg under the GNU General 
Public License v.3.0, which allows for reuse and distribution. Further 
details on code availability are in the Supplementary Information. 
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Extended Data Fig. 1| Pan-cancer expression profiling of 1,188 PCAWG 


donors. a, Tumour and normal RNA-seq data from 27 histotypes. The total 


Donors 


number of samples is shown to the right of the bars. Grey bars denote matched 
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healthy samples. b, Number of female versus male donors. c, Total number of 
tumour and matched healthy samples from the PCAWG study. A subset of 


tumours (dark violet) was metastatic. 
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Extended Data Fig. 2| Overview of the different sources of genetic variation 
considered in the analysis. a, For analyses of cis regulation, mono-allelic 
single-nucleotide germline variants (single nucleotide polymorphisms (SNPs), 
blue) were individually tested for association with total gene expression using 
standard eQTL approaches. Owing to their lowrecurrence in the cohort, 
somatic SNVs were aggregated in burden categories depending on their 
position relative to the gene tested (for example, promoter, 5’ UTR or intron). 
Local SNV burdens were then tested for association with ASE globally across all 
genes, as well as with total expression ona per-gene level using eQTL 
approaches. Trans effects were estimated by testing total gene expression for 
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association with mutational and epigenetic signatures. Window sizes were 1Mb 
for all somatic cis-eQTL analyses, and 100 kb for ASE and germline cis-eQTL. 

b, Overview of the different datasets and their contributions to the analyses 
described ina. Germline genotypes were derived from the matched healthy 
whole-genome sequencing (WGS) samples. Allele-specific SCNAs, mutational 
signatures and local SNV burdens were derived from the tumour WGS in 
comparison tothe unaffected WGS samples. ASE and total expression (FPKM) 
were derived from the tumour and normal RNA-seq data. Arrows indicate 
dependencies between individual analyses carried out. 
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histotype-specific analysis (FDR <5%, blue) and Pvalues of the same analysis variants in the pan-cancer and histotype-specific analysis. 
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Extended Data Fig. 4 | PCAWG-specific eGenes. a, Number of PCAWG-specific eGenes in relation to eQTL replication in various numbers of GTEx tissues. 


b, Number of eGenes of the PCAWG pan-analysis replicating in corresponding GTEx tissues. 
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of all genomic regions tested (n=1,049,102 with burden frequency > 1%) and of 
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boundaries (intronic or exonic). The seven genes shown here are known to be 
important in the pathogenesis of specific cancers. a, CDK12. b, PI4KA.c,/RF4.d, 
AICDA. e, Cllorf73 (also known as HIKESHI). f, BCL2. g, SGK1. 


Extended Data Fig. 7| Manhattan plots of seven somatic eGenes associated 
with genic lead burden. Altogether, 11 genic somatic eQTLs showed significant 
changes in gene expression associated with somatic burdens within the gene 
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Extended Data Fig. 8 | Scatter plots of eight somatic eGenes. Plots show the effect of the lead weighted burden on the gene expression residuals (obtained as 
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described in the Methods) of these genes. a, CDK12. b, PI4KA.c, IRF4. d, AICDA. e, Cllorf73.f, BCL2. g, SGK1.h, TEKTS. 
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Extended Data Fig. 9 | Roadmap epigenome marks overlapping flanking 
intervals with somatic burden. a, Maximum fold enrichment of epigenetic 
marks fromthe Roadmap Epigenomics Project across 127 celllines. The 
number of cell lines with significant enrichments is indicated in parentheses 
(FDR <10%); asterisks denote significant enrichments in at least one cellline. 
b, Mean percentages (over the 127 cell lines) of regions overlapping (by at least 
10% of their length) Roadmap epigenome marks, calculated using all genomic 
flanking regions (n = 1,637,638) and the subset of 556 flanking intervals 
associated with somatic eQTL (FDR<5%).c, Mutation rate per kilobase. 

d, Burden frequency (across the 127 cell lines) of the 556 flanking intervals in 
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somatic eQTLs (FDR<5%), overlapping 25 Roadmap epigenome marks. DNase, 
DNase only; EnhA, active enhancer; EnhAc, enhancer acetylation only; EnhAF, 
active enhancer flank; EnhW, weak enhancer; Het, heterochromatin; PromBiv, 
bivalent promoters; PromD, promoter downstream; PromP, poised promoters; 
PromU, promoter upstream; Quies, quiescent/low; ReprPC, repressed 
PolyComb; TssA, active TSS; TxReg, transcription regulatory; ZNF/Rpts, ZNF 
genes and repeats; Tx, transcription; Tx3, transcription 3’, Tx5, transcription 5’; 
TxEnh3, transcription 3’ enhancer; TxEnhsS, transcription 5’ enhancer; TxEnhwW, 
transcription weak enhancer; TxWk, weak transcription. 
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with carcinoma and European patients). e, Volcano plot of directionality of 
effects inthe analysis of all patients. f, g, Comparison of analyses between all 
patients and patients with carcinoma (f) and between all patients and European 
patients (g). The -log,)(P values) per signature—-gene pair are correlated 

(r= 0.763 (f) and r= 0.789 (g), Pearson correlation coefficient), especially above 
an FDR threshold of 10%. 


Extended Data Fig. 10 | Quality control of the association studies between 
gene expression and mutational signatures. a-c, Q-Q plots of the Pvalues of 
the linear model to associate expression of 18,831 genes with 28 mutational 
signatures across all 1,159 patients (a), 877 patients with carcinoma (b), or 891 
European patients (c).d, Number of significant associations (log,o- 
transformed) at different FDR thresholds (across all patients, patients 
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Extended Data Fig. 11| See next page for caption. 


Extended Data Fig. 11| Relationship between mutational signatures and 
gene expression patterns. a, b, Principal component analysis (PCA) of 
signatures across 1,159 patients (PCA on signature-specific SNVs per patient) 
(a) and signature-gene expression associations across 18,831 genes (PCA on 
adjusted P values of signature-gene expression associations) (b). The PCA on 
the SNVs recapitulates known interdependencies, for example, between 
signatures 7, whereas the PCA on the signature-gene association studies also 
emphasizes functional relatedness, for example, between signatures 2 and 13. 
c, Hierarchical clustering of signatures. The numbers at the nodes indicate the 
number of genes commonly associated with two to four respective signatures. 
The dendrogram shows genes that are associated with more than one signature 
mostly owing to similar SNV patterns of these signatures across patients. 


d, Frequency of number of significantly associated genes per signature 
(FDR<10%). Although many signatures are significantly associated witha few 
genes, 18 signatures are associated with more than 20 genes. Signature 9 is 
associated with more than 350 genes. Vice versa, 1,009 genes are associated 
with only one signature, 129 with two, 32 with three, 5 with four and 1 with five 
signatures. e, f, Mutational signature-gene associations, depicting positive 
associations between the expression of the canonical APOBEC pathway genes 
APOBEC3B (e) and APOBEC3A (f) and signature 2. The associations within the 
three cancer type with the strongest correlation between signature and gene 
expression (hepatocellular carcinoma (Liver-HCC), bone leiomyosarcoma 
(Bone-Leiomyo) and prostate adenocarcinoma (Prost-AdenoCA)) are shown. 
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genes with AEI (red) and SCNAs (blue) over the number of measurable genes for 
each patient across the cohort. Cancer types with high chromosomal 


instability also exhibit highest amounts of AEI. 


Extended Data Fig. 12| ASE analysis. a, All types of cancer are ordered by the 


average AEI frequency. The numbers of genes per patient for which ASE could 


be quantified are shown, stratified according to cancer type, resulting in 


between 588 and 7,728 genes per patient. b, Distribution of the fraction of 
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Extended Data Fig. 15 | See next page for caption. 
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Extended Data Fig. 15 | Overview of estimations of promoter activity and 
non-coding promoter mutations associations and patterns. a, b, The 
technical variation of the promoter activity estimates across varying library 
depth (a) and positional bias (b).c, The number of outlier promoters per 
tumour type according to promoter activity variance (variance larger than1.5 x 
the interquartile range). d, Distribution of promoter mutations around 
promoters across the PCAWG cohort for major, minor and inactive promoters. 
Red lines indicate the window 200-bp upstream of a TSS, in which major 
promoters show an enrichment of mutations whereas minor and inactive 
promoters donot.e, Distribution of promoter mutations around promoters 
for the top two most mutated types of cancer (skin melanoma and colorectal 
adenocarcinoma (ColoRect-AdenoCA)). Colorectal adenocarcinoma displays 
avery different mutational pattern from other types of cancer. f, Distribution 
of promoter mutations around major, minor and inactive promoters across 
several types of cancer. Red lines indicate the window 200-bp upstream of a 


TSS, in which major promoters show an enrichment of mutations whereas 
minor and inactive promoters do not. g, Schematic of the calculation of non- 
coding promoter mutational burden. h, Overview of non-coding promoter 
mutations per sample and the number of mutated promoters per tumour type 
for promoters with at least three mutated samples. i,j, Association of absolute 
(i) and relative (j) promoter activity with promoter mutations across all 
samples.k, I, Overview of promoter mutations for skin melanoma tumours. 

k, Most promoter mutations are C>T, which indicates UV-induced DNA 
damage. I, Distribution of promoter mutations for each mutation class reveals 
the enrichment of C>T mutations around the 200-bp window upstream. 

m,n, Overview of promoter mutations for colorectal adenocarcinoma 
tumours. m, Most promoter mutations are C>A and C>T.n, Distribution of 
promoter mutations for each mutation class does not display an enrichment of 
mutations around the 200-bp window upstream, differing from the mutation 
pattern of skin melanomatumours. 
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mutations observed at each position. The first promoter shows a highly 
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Extended Data Fig. 17 | See next page for caption. 


Extended Data Fig. 17 | Alternative splicing and association with somatic 
mutations. a, Number of exon-skipping events confirmed at different APSI 
thresholds in tumour (red), matched healthy (green) and GTEx (blue) samples 
for liver tissue. Dashed lines show the subset of exon-skipping events that only 
contain annotated introns. b, Number of exon-skipping events confirmed at a 
APSl level of greater than 0.3 for the individual histotypes. Transparent section 
of bars represents the fraction of novel events, containing at least one 
unannotated intron. c, Splicing landscape for exon-skipping events. t-SNE 
analysis based on exon-skipping PSI values for all |CGC tumour and healthy 
samples together with tissue-matched GTEx samples. d, Position-specific 
effect of somatic mutations on alternative splicing. Magnitude and direction of 
mutation-associated splicing alterations. e, Permutation-based FDR values for 
SAV detection based on the different types of cancer. f, Cancer gene set 
enrichment for SAV sets, shown for cancer census gene set (middle) and sets 
determined in ref. ** (left) and ref. 7° (right). g, Positional distributions 
(logarithms of distance from the nearest exons) of somatic variant creating 
novel splicing donors and acceptors. h, Sequence motif logos around somatic 
mutation creating novel splicing motifs. i, Example splicing effect of a branch- 
point mutation. UCSC genome browser RNA-seq coverage plots of cassette 
exon event in RBM28 between mutant and wild type. Mutant (bottom track) 


contains an A>G mutation 29 nucleotides upstream from the acceptor site of an 
affected exon. j, Distribution of new cassette exon events detected only within 
the PCAWG cohort. Top, number of events per histology type. Middle, events 
normalized to the total number of cassette exons detected in the histology 
types. Bottom, the number of exonization events per histotype for the subset 
with the novel cassette exons colocated toa somatic alteration near the 
acceptor or donor of the exon. k, Example of an exonization event in the 
tumour-suppressor gene STK11. RNA-seq read coverage for a part of the geneis 
showninred fora donor carrying the alternate allele and in grey forarandom 
donor with reference allele. The cassette exon event is shown asa schematic 
below, with blue (red) boxes denoting constitutive (alternative) exons and blue 
solid lines denoting introns. Magnified panels at the bottom show details from 
Integrative Genomics Viewer visualization, highlighting a somatic mutation at 
the 3’ end of the cassette exon. The associated sequencing changeis illustrated 
onthe bottom right corner, in which the vertical bar denotes the exon-intron 
boundary. I, Alu-based exonization mechanism. Top, the presence of an Alu 
element inan intron in antisense alone will still result in normal splicing. 
Bottom, specific mutations of the Alu sequence creates new splice sites and 
results in exonization. 
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Extended Data Fig. 18 | Recurrent and promiscuous RNA fusions. a, Features two genes: an edge is coloured blue if the fusion has evidence for matched 
of the 27 most recurrent in-frame or open-reading-frame-retaining fusions. structural rearrangements and is coloured red otherwise. Nodes and 
Kinase column indicates whether one of the gene partnersis a kinase gene b, connections are shown only between promiscuous genes. The colour intensity 
Network with connected clusters of at least 10 genes. Genes are represented as indicates whether a gene is involved more often ina fusion as a3’ (purple) or 5’ 
nodes, and the size of anodeis proportional to the number of gene-fusion (green) gene or both (white). 
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Extended Data Fig. 19 | Structural rearrangements associated with RNA 
fusions. a, Systematic classification scheme of all gene fusions based on 
underlying structural variants (SVs). Numbers of fusion events of different 
classes are shown to the right. b, Schematic of examples of different types of 
structural-variant-supported fusions: (1) direct fusions; (2) intercomposite 
fusions; and (3) intracomposite fusions. Bridged fusions are shown in Fig. 3b. 
Only one of the possible orders of genomic arrangement is depicted in each 
case, with break points highlighted by thunderbolts.c, Supported 
rearrangements for composite fusions bring the fused segments of two genes 
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Fusion transcript & Bicanpeitt 
significantly closer. Natural distance indicates the native distance betweentwo 
related structural variant break points. Effective distance indicates the 
distance between the final two break points of the intra- and intercomposite 
fusions. d, The break points of structural-variant-independent fusions are 
typically closer than those for other interchromosomal fusions, which 
indicates that at least some of the structural-variant-independent fusions may 
occur directly at the RNA level, mediated either by trans-splicing or read- 
through events. 
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editing, ASE, fusions and splicing. Each point is asample coloured by histotype, 
and its position is the log-transformed number of aberrations found ineach 
sample. The Benjamini-Hochberg-adjusted Pvalues are calculated froma 
likelihood ratio test assuming negative binomial distribution; histotype is used 
as aconfounder. 


Extended Data Fig. 20 | Correlation of the number of somatic genomic 
alterations with RNA alterations. Scatter plots of log,.-transformed 
frequency of DNAalterations versus log,)-transformed frequency of RNA 
alterations, in which each rowis a DNA alteration in the following order: 
structural variants, copy-number aberrations and non-synonymous variants. 
Each rowis an RNA alteration in the following order: expression outliers, RNA 
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Extended Data Fig. 21| Global view of DNA and RNA alterations affecting 
cancer pathways. Composite pie charts showing the percentages of RNA 
alterations, DNA alterations or both, affecting sets of genes in well- 


characterized cancer pathways and known to be functionally altered in cancer. 
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The sizes of circles represent the percentages of patients affected based onthe 
given gene set. The columns indicate different types of cancer. The numbers in 
parenthesis indicate the number of genes analysed for the specific pathway. 
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Extended Data Fig. 22| Breakdown of DNA and RNAalterations of cancer 
genes. a, Composite pie charts showing percentages of DNAand RNA 
alterations for top cancer-driver genes. The 20 most significant cancer-driver 
genes identified by the PCAWG group in pan-cancer level are depicted, with the 
sizes of the pie charts indicating the percentages of patients carrying 
alterations in the given driver gene. The areas represent the relative 
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percentages of patients exhibiting different alterations depicted by 
corresponding colours. When several types of alteration in one pathway affect 
the same patient, only a fraction is counted towards each type of alteration. 

b, Proportional bar plots showing the distribution of gene alterations for genes 
in the 7P53 and TGFB pathways. 
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Extended Data Fig. 23 | Trans-associations found by co-occurrence analyses. 
a, Scatter plot for association of gene expression outliers with cancer gene 
variants. Each dot represents an alteration pair. The x axis shows all COSMIC 
genes ordered alphabetically and the y axis represents the FDR-adjusted 
Pvalues (q values) based on Fisher’s exact tests. COSMIC genes with more than 
five significant associations (FDR < 5%) are coloured in red and labelled. b, Heat 
map showing the extent of associations between COSMIC gene somatic 
mutations and expression outliers of all genes. Each row indicates one gene, 
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and the colour intensity shows the significance of trans-association. COSMIC 
genes labelled to the right are ordered by the number of significant 
associations. Only the top 10 genes are shown. cc, Enrichment map showing the 
significant (FDR < 0.01) pathways based on the top 100 significant genes 
associated with B2Malterations. Colour intensity represents enrichment 
significance, node sizes the number of analysed genes belonging to the given 
pathway and edge sizes the degree of overlap between two gene sets. Only the 
top 10 enriched terms are shown. 
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Extended Data Fig. 25| Proportion of genes with DNA or RNAalterations. 

a, Full list of 731 genes that are both frequently and heterogeneously altered 
across both RNA- and DNA-level alterations. Yellow bars to the left indicate the 
proportion of samples that had DNA-level alterations, whereas green bars to 
the right indicate the proportion of samples with RNA-level alterations. Middle 
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columnisa heat map corresponding to the -log,)(P value). Asterisks indicatea 
COSMIC Cancer Gene Census (CGC) gene or PCAWG driver genes. 

b, Distribution of alteration types among all significant genes or just CGC or 
PCAWG driver genes. 
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Extended Data Table 1| RNA alteration data 


RNA data Total number of Mean number of gene 
gene alterations alterations per donor 
found 


Gene expression (PCAWG) 93,481 78.69 
Alternative promoters 246 224 207.26 


Alternative splicing 345,115 290.50 
Allele-specific expression 544,664 458.47 


RNA editing (with a non-synonymous 14486 12.19 
change) 

Combined gene-centric table (DNA and RNA | 1,523,098 1,282.07 
alterations) 
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Statistical parameters 


When statistical analyses are reported, confirm that the following items are present in the relevant location (e.g. figure legend, table legend, main 
text, or Methods section). 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


An indication of whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistics including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) AND 
variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Clearly defined error bars 
State explicitly what error bars represent (e.g. SD, SE, Cl) 


Our web collection on statistics for biologists may be useful. 
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Policy information about availability of computer code 


Data collection Core data was collected through Pan-cancer Analysis of Whole Genomes Data Coordination center through https://dcc.icgc.org/releases/ 
PCAWG/ 
Data analysis Core RNA-Seq alignment pipelines are available through Github/Docker: https://github.com/akahles/icgc_rnaseq_align, https:// 


hub.docker.com/r/nunofonseca/irap_pcawg/. STAR, TopHat2, HTSeq, Kallisto, Limix, PLINK, Bedtools, Vcftools, Bcftools, Samtools, Tabix, 
GATK, ASEReadCounter, Lavaan, Mediate, SplAdder, SAVNet, Sv2gf 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers 
upon request. We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


Somatic and germline variant calls, mutational signatures, subclonal reconstructions, transcript abundance, splice calls and other core data generated by the ICGC/ 
TCGA Pan-cancer Analysis of Whole Genomes Consortium is described here58 and available for download at https://dcc.icgc.org/releases/PCAWG. Additional 
information on accessing the data, including raw read files, can be found at https://docs.icgc.org/pcawg/data/. In accordance with the data access policies of the 
ICGC and TCGA projects, most molecular, clinical and specimen data are in an open tier which does not require access approval. To access potentially identification 
information, such as germline alleles and underlying sequencing data, researchers will need to apply to the TCGA Data Access Committee (DAC) via dbGaP (https:// 
dbgap.ncbi.nlm.nih.gov/aa/wga.cgi?page=login) for access to the TCGA portion of the dataset, and to the ICGC Data Access Compliance Office (DACO; http:// 
icgc.org/daco) for the ICGC portion. In addition, to access somatic single nucleotide variants derived from TCGA donors, researchers will also need to obtain dbGaP 
authorisation. Data derived specifically from RNA-Seq analysis can be found at https://dcc.icgc.org/releases/PCAWG/transcriptome. Subfolders contain 
identification and quantification of alternative promoter usage, alternative splicing, RNA fusions, gene expression, transcript-level expression, and RNA editing. 
Identified eQTLs are in https://dcc.icgc.org/releases/PCAWG/transcriptome/eQTL and a binarized table indicating all RNA and DNA alterations for each gene can be 
found in the subfolder https://dcc.icgc.org/releases/PCAWG/transcriptome/recurrence_analyses/. Additionally, QC metrics and metadata are also included. 
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Study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size This study represents the analysis of 1,188 donors from the Pan-Cancer Analysis of Whole Genomes project. The sample size was limited to 
the availability of RNA-Seq data from matched donors with WGS data from the PCAWG project. 


Data exclusions Alarger set of 2,217 RNA-Seq libraries were initially collected and data were excluded after QC analysis. The QC criteria was standard and pre- 
established before excluding data. 


Replication Reproducibility of the analysis is ensured through data-sharing and code-sharing. Unfortunately, at the time of the analysis there were no 
approrpriate datasets to use for replication studies of associations, since this is one of the largest collections of pan-cancer whole genomes 
and matched transcriptomes. For the somatic eQTL analysis, there were some related studies that came to similar conclusions and are noted 
in the Supplementary Information. 


Randomization | Cancer histotypes were defined by the PCAWG Pathology and Clinical Correlates Working Group based on tumor histology. These tumor 
subtypes were accounted for as covariates in all applicable analyses of association. 


Blinding Blinding was not relevant to our study as it was essential to understand underlying confounding variables in our associations, such as tumor 
subtype, sex, etc. 
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Organisms have evolved diverse behavioural strategies that enhance the likelihood of 


encountering and assessing mates’. Many species use pheromones to communicate 
information about the location, sexual and social status of potential partners’. In 
mice, the major urinary protein darcin—which is present in the urine of males— 
provides acomponent of ascent mark that elicits approach by females and drives 
learning®*. Here we show that darcin elicits acomplex and variable behavioural 
repertoire that consists of attraction, ultrasonic vocalization and urinary 

scent marking, and also serves as a reinforcer in learning paradigms. We identify a 
genetically determined circuit—extending from the accessory olfactory bulb to the 
posterior medial amygdala—that is necessary for all behavioural responses to darcin. 
Moreover, optical activation of darcin-responsive neurons in the medial amygdala 
induces both the innate and the conditioned behaviours elicited by the pheromone. 
These neurons define a topographically segregated population that expresses 
neuronal nitric oxide synthase. We suggest that this darcin-activated neural circuit 
integrates pheromonal information with internal state to elicit both variable innate 
behaviours and reinforced behaviours that may promote mate encounters and mate 


selection. 


Communication through scents elicits innate and learned behavioural 
repertoires that enhance the reproduction and survival of the spe- 
cies’. Male mice deposit scent marks that attract females and enable 
assessment of the quality and compatibility of potential mates”. Innate 
attraction in females is elicited by the non-volatile protein pheromone 
darcin (MUP20)**, a member of the major urinary protein (MUP) family 
that is recognized by receptors in the vomeronasal organ®. Darcin not 
only elicits innate attraction but can also serve as an unconditioned 
stimulus for both place and odour conditioning, enabling a female to 
recognize, assess and locate males on the basis of their scent marks? >. 

We developed a quantitative behavioural paradigm to examine 
the effects of darcin, and found that the pheromone elicits a com- 
plex and variable behavioural array. Female mice were placed ina 
chamber equipped with two ports that contained glass fibre filters 
embedded with different social olfactory cues, and entry to the ports 
was quantified. The frequency of port entry provides a measure of 
preference for the cues present on the individual filters. During the 
initial habituation each port contained a blank filter, and port entries 
(pokes) were infrequent (mean + s.e.m. poke count: left port 18 +3, 
right port 14 + 3; Fig. 1b1). The mice were then exposed in their home 
cage? to bedding that had been soiled by male mice, after which the 
number of pokes increased substantially without any apparent side 
bias (left port 247 + 35, right port 246 + 3; Fig. 1b2). The response to 
darcin was therefore examined in cycling female mice after exposure 


to male-soiled bedding’. Poke frequency was higher for the port that 
contained the recombinant darcin (darcin-containing port 516 + 47, 
blank 326 + 21; Fig. 1b3). Exposure to urine that contained very low levels 
of darcin (low-darcin urine, from male BALB/c mice)‘ also elicited more 
frequent port entries than did blank filters in this assay, both with and 
without the addition of recombinant darcin (low-darcin urine 386 +42, 
blank 154 +14, Fig. 1c3; recombinant darcin-added urine 391+ 29, blank 
96 +18, Fig. 1d3). 

Innately attractive cues can often serve as a teaching signal, reinforc- 
ing both classical and instrumental learning’. We examined whether 
exposure to darcin alone or to low-darcin urine elicits a lasting prefer- 
ence for the darcin port after the stimulus is removed. Female mice were 
exposed to a social cue in one port and then placed into aclean chamber 
onthe following day with blank filters in both ports. Poke counts were 
significantly greater in the port that had previously contained either 
darcin (285 + 38, blank 146 + 16; Fig. 1b4) or urine with equivalent levels 
of darcin (179 +15, blank 65 + 9; Fig. 1d4). By contrast, exposure to urine 
with very low levels of darcin did not result in a port preference during 
recall sessions on the following day (prior exposure to low-darcin urine 
147 +14, blank 147 + 16; Fig. 1c4). Therefore, both low-darcin male urine 
and darcin elicit a port preference, but only exposure to normal levels 
of darcin results in a remembered preference. 

We also observed that female mice that were exposed to darcin emit- 
ted ultrasonic vocalizations and engaged in urinary scent marking 
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Fig. 1|Darcin elicits an array of behaviours. a, Timeline of the two-port 
preference assay. b-d, Cumulative poke counts in ports containing darcin 

(1p! pg”) (b), urine from BALB/c male mice with very low darcin levels (less than 
0.1 p11 pg™)* (c) and BALB/c male urine with added recombinant darcin (1p! pg) 
(d) (red), compared with ports containing control filters (blue) during cue- 
exposure sessions. Counts are shown on days 2, 13, 15 and 16 (graphs 1-4, 
respectively), as indicated by arrows onthe timeline. Mean (bold lines, n=30 
mice) and individual (fine lines) counts are shown. The time-stamps for 
ultrasonic vocalizations and scent marking are indicated as arrowheads 

(b (3, 4)). Bias in counts was assessed using the two-sided Wilcoxon signed-rank 
test (b (3, 4), **P=0.004, n=10;c¢ (3), *P=0.006, n=10; d (3, 4), **P=0.004, 
n=10).e, Spectrogram of an example song detected during darcin-exposure 
sessions. f, g, Mean call count (horizontal line) and total number of calls made 
by individual mice (diamonds); n=10 mice (f) as tested in b, n= 43 mice 

(g) as tested across the study. Calls were compared using the two-sided 
Wilcoxon signed-rank test (f, *adjusted-P = 0.03; g, ***adjusted-P= 0.00003 and 
P=0.0001).h, Latency to urinary marking and vocalization in response to 
darcin (n=24 mice) and during recall (n=14 mice) sessions. Mean (squares) and 
individual (circles) latencies are shown. The bounds in box plots are defined by 
the 25th and 75th percentile of the distribution. The line represents the median 
and the upper and lower whiskers represent 75th percentile + 1.5 x interquartile 
range (IQR) and 25th percentile — 1.5 x IQR, respectively. Latencies are 
compared using the two-sided Wilcoxon signed-rank test (*P= 0.03). 


(Fig. le-h). Scent marks were located closer to the darcin port (Extended 
Data Fig. 1a, b) and were smaller in size (Extended Data Fig. 1c) than 
those observed from free urination, which was consistent with the 
deliberate deposition of scent close to darcin; this suggests a distinction 
between the two behaviours. Ultrasonic vocalizations were consistently 
linked with urinary marking and occurred within 40 ms of one another 
(mean + s.e.m. 42 + 9 ms; Supplementary Video 1). These episodes 
did not occur immediately upon exposure to darcin, but appeared 
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Fig. 2| Optogenetic silencing of the AOB results in the suppression of darcin- 
evoked behaviours. a, eNDHR-eYFP expression in the AOB. Scale bar, 200 pm. 
P, posterior; V, ventral; this experiment was independently repeated with18 
mice. b, Timeline of the two-port preference assay. c-f, Cumulative poke 
counts with (c, e) and without (d, f) optical silencing of the AOB. Inc, d mice 
were exposed to darcin (3) (1 pl pg’) (n =10) and ine, f mice were exposed to 
C57BL/6 male urine (3) with normal levels of darcin (1 pg p)* (n= 8) in one port 
(red) anda blank filter (blue) in the second port. Mean (bold lines) and 
individual (fine lines) counts are shown. The time-stamps for ultrasonic 
vocalizations and scent marking are indicated as arrowheads (d (3, 4)). Counts 
were compared using the two-sided Wilcoxon signed-rank test (d (3), 

**P< 0.001; e (3), f (3, 4), *P< 0.008). g, Mean call count (horizontal lines) and 
total number of calls made by individual mice (diamonds) during darcin 
exposure with (c, e) and without (d, f) AOB silencing; n=10 mice. Calls were 
compared using the two-sided Wilcoxon signed-rank test. 


after along and variable delay during a100-min session (mean latency 
53 +5 min, Fig. 1h). Vocalization and urinary scent marking were also 
observed during recall sessions (Fig. 1f-h). These episodes occurred 
earlier inthe recall session than in the darcin-exposure session (recall 
sessions 16 + 4, darcin sessions 53 + 5 min; Fig. 1h). Male urine that 
contained normal levels of darcin also stimulated scent-marking behav- 
iour during cue and recall sessions, but low-darcin urine stimulated 
marking only when present and not during recall sessions (Extended 
Data Fig. 2d). Thus, darcin induces a behavioural repertoire that com- 
prises attraction and ultrasonic vocalization simultaneous with urine 
marking, behaviours that may serve as reciprocal communication. 
Moreover, this behavioural repertoire is also observed during recall 
sessions in the absence of darcin. 

We next implemented genetic strategies to identify the neural 
circuitry that mediates these darcin-induced behaviours. Darcin binds 
to V2R receptors on sensory neurons inthe vomeronasal organ®. These 
neurons extend axons through the skull, where they converge to form 
microglomeruli within the accessory olfactory bulb (AOB)*. Microglo- 
meruli are innervated by mitral cells that project to multiple brain 
regions—including the cortical amygdala, bed nucleus of the stria 
terminalis, and the medial amygdala (MeA)®”. We demonstrated that 
this pathway is responsible for the behavioural repertoire elicited by 
darcin by silencing the AOB. Bilateral injection of an adeno-associated 
virus (AAV) encoding halorhodopsin” fused to enhanced yellow fluores- 
cent protein (eNpHR-eYFP) resulted in the expression of eNDHR-eYFP 
(Fig. 2a) in the majority of mitral cells inthe AOB (73 + 8% across mice). 
AOB silencing eliminated the preference for the darcin-containing 
port (180 + 49, blank 149 + 37; Fig. 2c3) and suppressed darcin-evoked 
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Fig. 3 | Activation of darcin-responsive neurons inthe MeA recapitulates 
pheromone-induced behaviours. a, Genetic strategy used to express ChR2in 
pheromone-responsive neurons. b, Timeline of experimental manipulations. 
c-e, Left, representative images showing eYFP expression in the posterior MeA 
after exposure to darcin (mean +s.e.m.: eYFP counts, 255 +29 in the MeApd and 
115 +16 inthe the MeApv) (c), saline (16 + 5 in the MeApd and 23 + 7in the MeApv) 
(d) and MUP11 (54 +10 in the MeApd and 42 + 9 inthe MeApv) (e). Scale bars, 
400 pm; L, lateral; V, ventral. Right, corresponding cumulative poke counts. 
Mean (bold lines, n= 13 mice for each group, n=39 mice in total) and individual 
(fine lines) counts are shown. The time-stamps for ultrasonic vocalizations and 
scent marking are indicated as arrowheads (c (2, 3), e (2)). Counts were 
compared using the two-sided Wilcoxon signed-rank test (c (2, 3), 

***P= 0.0002). f, g, Mean (horizontal lines) and total calls made by individual 
mice (diamonds) during the light-stimulation sessions (f) and subsequent 

recall sessions (g); n =13 mice per group. Calls were compared using the 
two-sided Mann-Whitney test adjusted for multiple comparisons (f, *P< 0.05; 
g, *P=0.02). 


ultrasonic vocalizations and scent marking (Fig. 2g). By contrast, the 
preference for male urine with normal levels of darcin was not sup- 
pressed during AOB silencing (423 + 40, blank 243 + 25; Fig. 2e3). AOB 
silencing did not affect port investigation during the initial habituation 
periods with the blank filters (Fig. 2c1, 2-2f1, 2). 

Exposure to recombinant darcin elicited a memory for the darcin 
port (prior exposure to darcin 190 + 16, blank 56 + 6; Fig. 2d4), but a 
port preference was not observed if the AOB was silenced during dar- 
cin exposure (prior exposure to darcin 55 + 15, blank 60 +18; Fig. 2c4). 
Females that were exposed to male urine containing normal levels of 
darcinalso exhibited a persistent port preference during AOB silencing 
(urine 423 + 40, blank 243 + 25; Fig. 2e3), but failed to showa preference 
for this port in the recall sessions (prior exposure to urine containing 
normal levels of darcin 97 + 16, blank 94 + 15; Fig. 2e4). These observa- 
tions show that the AOB is necessary for darcin-induced attraction 
behaviours as well as for conditioning. Other components of male 
urine also elicit attraction that is independent of the AOB but fail to 
reinforce conditioned behaviours. 


The mitral cells—the projection neurons of the AOB—send axons to 
the MeA®’. We identified the neurons of the MeA that are responsive 
to darcin by using the promoter of the activity-dependent gene Arcto 
express channelrhodopsin-2 (ChR2), a light-gated ion channel”. AAV 
encoding Cre-dependent channelrhodopsin fused to the fluorescent 
protein eYFP was injected into the MeA of transgenic mice (Arc-CreER 
mice; Fig. 3a) in which the Arc promoter drives the expression of the 
tamoxifen-sensitive Cre recombinase (Cre-ER)”. The administration of 
tamoxifen followed by exposure to darcin should result in the expres- 
sion of ChR2-eYFP in neurons that are activated by darcin. We com- 
pared the expression of Fos with that of ChR2-eYFP and found that 
ChR2-eYFP is faithfully expressed in neurons that respond to darcin 
(78 + 4% of the ChR2-eYFP* neurons also express endogenous Fos, 
and 79 + 3% of the neurons that express endogenous Fos also express 
ChR2-eYFP; n=6 mice). 

We next determined whether the activation of neurons that express 
ChR2 after exposure to darcin is sufficient to recapitulate the behav- 
iours elicited by darcin. Arc-CreER mice injected with AAV encoding 
Cre-dependent ChR2-eYFP in the posterior dorsal medial amygdala 
(MeApd) and the posterior ventral medial amygdala (MeApv) (Fig. 3a, b) 
were treated with tamoxifen and then exposed to darcin, saline ora 
control MUP (MUP11)**. Histological analysis of ChR2-eYFP expression 
induced by exposure to darcin revealed a dense clustering of ChR2- 
eYFP neurons that were restricted largely to the MeApd and the MeApv 
(Fig. 3c). Exposure to MUP11** revealed sparser labelling in both the 
MeApd and the MeApv, and even sparser labelling was observed after 
exposure to saline (Fig. 3d, e). Mice that expressed ChR2-eYFP induced 
by exposure to darcin, MUP11 or saline were introduced into the behav- 
ioural chamber after two days of habituation. We then photoactivated 
the MeA when the mice entered one of the two ports containing blank 
filters, to recapitulate exposure to darcin. Mice that expressed ChR2- 
eYFP induced by darcin exposure exhibited a strong preference for 
the stimulation port (mean poke counts: light 202 + 21, no light 40 +7; 
Fig. 3c2). Photoactivation of the ensemble of darcin-responsive neu- 
rons also elicited ultrasonic vocalizations and scent marking (Fig. 3f, 
Extended Data Fig. 2a—c). Photoactivation of the MeA in mice that 
expressed ChR2-eYFP after exposure to saline (light 26 + 3, no light 
24 + 2; Fig. 3d2) or MUP11 (light 19 + 4, no light 20 + 5; Fig. 3e2) did not 
elicit any preferences for the stimulation port, and did not result in 
ultrasonic vocalizations or urinary scent marking (Fig. 3f). 

Mice that expressed ChR2-eYFP in neurons that were responsive to 
darcin exhibited a remembered preference for the port in which they 
previously received light stimulation (prior photoactivation 126 + 8, 
no activation 39 + 6; Fig. 3c3). Control mice that expressed ChR2-eYFP 
in neurons after exposure to MUPI1I (prior photoactivation 20 + 3, no 
activation 16 + 2; Fig. 3e3) or to saline (prior photoactivation 20 + 2, 
no activation 23 + 3; Fig. 3d3) exhibited no preference for the previous 
light-stimulated port. In recall experiments, ultrasonic vocalizations 
and scent marking were detected only in mice that previously experi- 
enced photostimulation of neurons expressing ChR2-eYFP induced by 
exposure to darcin (Fig. 3g, Extended Data Fig. 2a—c, Supplementary 
Video 2), and notin mice expressing ChR2-eYFP in neurons activated by 
exposure to MUPI1 or saline (Fig. 3g). We demonstrated that exposure 
to darcin could also result in conditioned place preference (Extended 
Data Fig. 3a, b). Therefore, photoactivation of a population of neurons 
that express ChR2 induced by darcin exposure can elicit innate attrac- 
tion, ultrasonic vocalizations, urinary scent marking and reinforce 
conditioned behaviours. 

Lactating females fail to exhibit attraction to darcin”. We therefore 
asked whether darcin activates MeA neurons in lactating females. Lac- 
tating Arc-CreER mice expressing Cre-dependent eYFP in the MeA 
were exposed to darcin three to five days postpartum. Exposure to 
darcin in virgin females resulted in dense labelling of posterior MeA 
neurons with eYFP expression (mean +s.e.m. eYFP” cells: 255 +29 in the 
MeApd and 115 +16 inthe MeApv). Exposure to darcin during lactation 
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Fig. 4| nNOS neurons inthe MeA are necessary for darcin-mediated 
behaviours. a, Representative image showing the co-expression of eYFP 
expressed in darcin-responsive neurons (Fig. 3c, left) and nNOS in the posterior 
MeA (D, dorsal; M, medial; n=7 mice). Scale bars, 400 pm (main image, left); 
100 pm (smaller images, right). b, eYFP expression in coronal sections of the 
posterior MeA of annNOS-ires-Cre mouse (scale bar, 400 pm; CoApm, 
posterior medial cortical amygdala; this experiment was independently 
repeated with 66 mice).c, Timeline of the two-port preference assay. 

d-f, Cumulative poke counts in mice expressing eNpHR (e, f) or eYFP (d) in 
nNOS neurons. Neurons expressing eYFP (n=12 mice) (d) and eNpHR (n=11 
mice) (e) were photostimulated, with no photostimulation of neurons 
expressing eNpHR (n=11 mice) (f). Mean (bold lines) and individual (fine lines) 
counts are shown. The time-stamps for ultrasonic vocalizations and scent 
marking are indicated as arrowheads (d (3, 4), f (3, 4)). Counts were compared 
using the two-sided Wilcoxon signed-rank test (d (3, 4), ***P< 0.0005; f (3, 4), 
**P<0.005).g, Vocalization counts of mice expressing eYFP (n =12 mice) and 
eNpHR (n=11 mice). Mean (horizontal lines) and total calls made by individual 
mice (diamonds). Calls were compared using the two-sided Wilcoxon signed- 
rank test, adjusted for multiple comparisons (*P< 0.05). 


resulted in a sparse labelling (23 + 11in the MeApd and 15+12inthe 
MeApvy) at levels similar to that observed upon saline exposure (16 +5 
in the MeApd and 23 + 7 in the MeApv) (Extended Data Fig. 4f-h). By 
contrast, darcin activates an equivalent number of mitral cells in the 
AOB of both virgin and lactating females (Extended Data Fig. 4a—e, Fos 
cells in virgin females 378 + 35 and in lactating females 358 + 45; n=6, 
P=0.9). Therefore, the darcin-activated circuit is likely to be gated by 
lactation in the MeA. 

We next identified a genetic marker, neuronal nitric oxide synthase 
(nNOS), which defines the population of MeA neurons that mediate 
the darcin-induced behaviours. Immunohistochemical examination 
of the MeA of Arc-CreER mice revealed that a considerable fraction of 
neurons that express ChR2-eYFP in response to darcin also express 
nNOS. We found that 18% of neurons in the posterior MeA express nNOS. 
Double-labelling experiments demonstrated that this nNOS-expressing 
population (denoted nNOS neurons) consists of 55 + 4% excitatory 
neurons (vGlut2’* cells) and 24 + 3% inhibitory neurons (Gad2’ cells). 
We observed that 74 + 2% of the ChR2-eYFP-expressing neurons that 
are labelled upon darcin exposure also express nNOS, whereas 66 +3% 
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of the nNOS neurons also express ChR2-eYFP (Fig. 4a). Similar values 
are obtained in Arc-CreER mice that are exposed to male urine contain- 
ing normal levels of darcin. The pheromones ESP1™, MUP11>* and cat 
salivary lipocalin Fel-D4*, as well as female urine, activated less than 
20% of the nNOS neurons (Extended Data Fig. 5, Extended Data Table 1). 
The majority of the MeA neurons activated by these stimuli do not 
express nNOS, which demonstrates the specificity of the response of 
nNOS neurons for darcin. 

These observations suggest that activation of the nNOS neurons in 
the MeA should elicit the behavioural repertoire that is observed upon 
darcin exposure. We therefore injected AAV encoding Cre-dependent 
ChR2-eYFP into the posterior MeA of mice in which the Nos1 promotor 
drives the expression of Cre (nNOS-ires-Cre) to express channelrhodop- 
sin-2 in nNOS neurons. We then photoactivated nNOS* MeA neurons 
whenthe mouse entered one of the two ports containing blank filters, 
and observed a strong preference for the stimulation port (light 541+45, 
no light 66 +12; Extended Data Fig. 3d2). Moreover, photostimulation 
of nNOS cells expressing ChR2-eYFP evoked ultrasonic vocalization 
and scent marking (Extended Data Figs. 2a—c, 3f). Photoactivation of 
these MeA neuronsalso reinforced conditioned behaviours (prior light 
295 +16, no light 57 + 11; Extended Data Fig. 3d3). Control experiments 
in which AAV encoding Cre-dependent eYFP was injected into the MeA 
of nNOS-ires-Cre mice failed to elicit any darcin-mediated behaviours 
upon photostimulation (light 24 + 4, no light 25 + 5, P= 0.8, Extended 
Data Fig. 3c2; prior light 23 + 7, no light 25 + 6, P=0.8, Extended Data 
Fig. 3c3). Therefore, photoactivation of ChR2-eYFP in nNOS neuronsin 
the MeA is sufficient to recapitulate both the innate and the reinforcing 
behaviours that are observed upon exposure to darcin. 

These observations predict that silencing of the nNOS neurons inthe 
MeA should impair the behavioural response to darcin. The bilateral 
injection of an AAV (AAVDJ-EFla-DIO.eNpHR3.0-eYFP, Fig. 4b) encoding 
the Cre-dependent opsin into the nNOS-ires-Cre mice resulted in the 
expression of halorhodopsin” in nNOS neurons inthe MeA. In mice in 
which the nNOS neurons were silenced, no preference was observed 
for filters that contained recombinant darcin in the poke-preference 
assay (darcin 35 +5, blank 39 + 5; Fig. 4e3), and darcin elicited no port 
preference during recall sessions (prior exposure to darcin 35 +5, blank 
42 + 6; Fig. 4e4). Ultrasonic vocalizations and urinary scent marking 
were also eliminated upon light-induced silencing of nNOS neurons 
(Fig. 4g). As a control we showed that, when photostimulation was 
terminated, darcin elicited a strong port preference that was also 
observed during recall sessions (prior exposure to darcin 375 + 40, 
blank 186 + 28; Fig. 4f4). Light-induced silencing in the MeA of mice 
that expressed eYFP innNOS neurons failed to inhibit darcin-mediated 
behaviours (Fig. 4d). Notably, silencing of the MeA also inhibited the 
port preference that was elicited by urine containing normal levels of 
darcin (Extended Data Fig. 6b3). These observations suggest that the 
components in urine other than darcin that elicit port preference also 
require the MeA. We found that silencing of nNOS neurons resulted in 
inhibition of poking to control filters after females were exposed to 
male scent in their home cages (blank 24 +5, blank 23 +5; Fig. 4e2). We 
performed additional experiments to demonstrate that the inhibition 
of darcin-evoked behaviours upon the silencing of nNOS neurons was 
not due to diminished motivation (Extended Data Fig. 7). 

We thenasked whether the nNOS neurons inthe MeA are also required 
for the expression of the remembered response. Female mice were 
exposed to darcin, and then nNOS neurons were silenced only during 
recall sessions. These mice exhibited a strong preference for the port 
that had previously contained darcin (prior exposure to darcin 254 +22, 
blank 77 +17; Extended Data Fig. 6e4). Darcin-responsive neurons that 
express nNOS in the MeA are therefore necessary to recapitulate the 
innate and reinforcement behaviours elicited by darcin. Recall of darcin 
memory, however, no longer requires this neural population. 

The array of properties elicited by darcin suggests that this phero- 
mone does notelicit a simple behavioural response, but rather activates 


acomplex integrative process that may optimize mate encounters and 
mate selection. First, the attractive response is rapid and prolonged 
upon darcin exposure, whereas vocalization and scent marking are 
variable and often occur after long delays. Activation of the nNOS popu- 
lation of neurons by darcin may therefore elicit a state of ‘sexual drive’, 
which increases the probability of individual component behaviours 
that are suited to enhance the likelihood of mate encounters under 
different environmental circumstances. Darcin exposure results in 
exploration and assessment of the darcin source: the urine of a domi- 
nant male. In the absence of the male, after active search strategies 
have failed, the female may emit ultrasonic vocalizations synchronized 
with scent marking in an attempt to communicate her presence and 
her current oestrus status to the male. 

Second, darcin activation of the nNOS neurons reinforces bothcon- 
textual and olfactory learning—generic learning processes—which may 
allow the female to return to the location of the male’s scent mark‘ or to 
track airborne scents of the territorial male*>. The MeA may therefore 
provide a signal mediated by darcin to midbrain dopamine neurons to 
reinforce more traditional ‘non-social’ reinforcement learning’. The 
more stereotypical communication behaviours elicited by darcin— 
vocalization and scent marking—might also result from reinforcement 
of aspecific set of social behaviours that coordinate a successful mate 
search. Whereas the nNOS neurons are required for the behavioural and 
reinforcing effects of darcin, recall of darcin-elicited memory no longer 
requires this neural population—presumably reflecting the transfer of 
a learned representation in other brain structures. 

Third, we observe that male urine with very low levels of darcin elicits 
attraction but does not result in reinforcement learning or memory 
of port preference. This attractive response does not require the AOB 
but is eliminated upon silencing the nNOS neurons of the MeA. These 
observations suggest that the MeA is integrating pheromonal informa- 
tion from the vomeronasal pathway with olfactory cues from the main 
olfactory system to elicit both innate attraction and learning. 

Finally, the response to darcin is dependent on internal state. Lactat- 
ing females fail to exhibit this complex behavioural response to darcin 
exposure”. We found that darcin activates the projection neurons in 
the AOB in lactating females, but fails to activate the nNOS neurons in 
the MeA (Extended Data Fig. 4). Taken together, these observations 
suggest that the nNOS neurons of the MeA integrate internal state with 


the pheromonal cues to mediate both innate variable behaviours and 
reinforced behaviours that may coordinate a successful mate search. 
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Methods 


Mice 

All surgical and experimental procedures were performed in compli- 
ance with the Guide for the Care and Use of Laboratory Animals” from 
the National Institute of Health Standards, and approved by the Cold 
Spring Harbor Laboratory and Columbia University Medical Center 
Institutional Animal Care and Use Committees. Experiments were con- 
ducted with 279 female mice between 6 and 30 weeks old. Mice were 
purchased at 4 weeks old and were handled for at least 10 min each 
day for aminimum of 5 days before experimentation. Surgeries were 
performed on mice that were 6 weeks old in order to match their brain 
coordinates to the Allen Reference Atlas. The mouse lines used were as 
follows: Arc-CreER (a gift from C. Denny at Columbia University; also 
available from Jackson Laboratory, Jax stock 022357); ICR outbred 
(CD-1) wild-type mice (Harlan/Envigo); Ail4 (Rosa-CAG-LSL-tdTomato); 
nNOS-ires-Cre (Jax stock 017526); vGlut-ires-Cre (Jax stock 028863); 
Gad2-T2a-NLS-mCherry (Jax stock 023140). The nNOS-ires-Cre mice 
were crossed to ICR outbred mice (Harlan/Envigo) for 15 generations 
to exchange their genetic background to the ICR mice. Throughout the 
study, five mice were co-housed ina single cage for two to six months. 
This long-term co-housing has the potential to suppress oestrus cycling 
in females (the Lee—-Boot effect)”. To ensure that all females had pre- 
viously encountered male scent and were showing normal oestrus 
cycling, females were exposed to male-soiled bedding from an unfamil- 
iar strain for at least 60 h’. They were then visually evaluated for their 
stage of oestrus before the experimental testing. One hour before test- 
ing, each mouse had its vaginal opening photographed for evaluation. 
After oestrus entrainment, most females (more than 90%) were evalu- 
ated to be in the pro-oestrus stage’ of the cycle (with swollen, moist, 
pink and wide-open vaginal openings’’) and advanced into behavioural 
testing. Mice were kept in a controlled 12-h day/night (7:00 to 19:00) 
cycle and tested only during the night phase (23:00 to 6:00). 


Behavioural assays 

Before behavioural training, mice were handled for 10 min each day 
for five days, and were given access to a mouse exercise cage that was 
enriched with spinning discs and toys for one hour every day during 
the experimental period. Training took place in a custom-designed 
sound-isolation chamber containing a behavioural arena (25 x 25 x 28 
cm) integrated with two stimulus ports (circular nose port (4.6-cm 
diameter) with an attachable circular cup for the filter (1.3-cm diam- 
eter)), which were surrounded by distinct visual stimuli (stripe and 
circle stickers were used on either side (Context Kit for Conditioned 
Place Preference, Stoelting)) onthe walls. Mice were tested under room 
light during the night phase of their day/night cycle (23:00 to 6:00). 
Mice poked their snouts into stimulus ports to sample the social stimuli. 
The social cue was presented ona glass microfibre filter in a portable 
cup attached to the nose port. Social cue ports were constructed out of 
metal and boiled in detergent (1-2% Alconox for at least 15 min), rinsed 
thoroughly with water, dipped in 3% hydrogen peroxide and ethanol, 
rinsed again with running distilled water and air-dried to clean off any 
contaminants between experiments. The frequency and duration of 
nose pokes were quantified by means of an infrared beam within the 
port. The behavioural nose poke data were acquired through a MATLAB 
interface and a Bpod. 

Ultrasonic vocalizations in the chamber were captured using an 
Avisoft ultrasound microphone witha frequency range of 20-200 kHz. 
The microphone was connected to a portable time-code generator and 
reader (Horita PTG2), which generated a time code that was embed- 
dable into both the audio and the video files. Avisoft Recorder USGH 
software was used to record vocalizations and integrate time codes 
from the PTG2. To capture urinary scent-marking behaviours with 
the embedded time code, a Marshall Genlock 3G-SDI HDMI camera 
was mounted at the base of the transparent chamber. An AJA K; Pro 


Recorder, which was connected to the camera and the PTG2, was used 
to record video for the entire duration of the session. The time code 
generated by the PTG2 was visible as a display within the video window 
of the Marshall camera recording through the AJA recorder, and was 
also recorded by Adobe Captivate. 

The nature of the ultrasonic vocalizations in each session was ana- 
lysed with Avisoft SAS Lab Pro (Supplementary Videos 1, 2). We quan- 
tified call counts as the number of syllables in a given session of an 
individual mouse. Comparison of the calls emitted in response to the 
pheromone and the calls emitted upon the photoactivation of MeA 
neurons confirmed that the pheromone- and photoactivation-evoked 
syllables shared similar sonic qualities (Extended Data Fig. 8, Extended 
Data Table 2). Allspectrograms were additionally parametrized using 
SAP 2011” and MUPET” software, and all syllables emitted by the mice 
during the sessions were manually extracted and classified for analysis 
(Extended Data Fig. 8). To analyse the urinary scent-marking behaviour 
of the mice, Adobe Premiere Pro was used. To determine the concur- 
rency between urination and vocalization, Adobe Premiere Pro was 
used to align the video to the audio by using the time shown by the 
OLED display of the PTG2 (visible in the video window) in conjunction 
with the time code encoded in the audio file as temporal references. In 
addition, engagement of the poke port resulted in the simultaneous 
activation of a red LED, which was visible to the human eye in the video 
window but not to the mice, and a TTL (transistor-—transistor logic) 
pulse, which was recorded in the ultrasonic audio track as a labelled 
time event by the Avisoft Recorder USGH software. Engagement of 
the port was thus used as an additional online reference to observe 
the alignment of audiovisual events, and this was recorded by Adobe 
Captivate. In addition, the distances from urinary drops to the base 
of each of the ports were quantified for the pheromone, photoactiva- 
tion and free-urination sessions. Distances were extrapolated from 
individual frames of the video using Adobe Photoshop. 

Mice were placed in the behaviour chamber for 100 min once per 
day for each session during the dark phase (23:00 to 6:00) of their day/ 
night cycle (7:00 to 19:00). The behavioural chamber and the stimulus 
ports were thoroughly cleaned with 1-2% Alconox detergent, distilled 
water, 3% hydrogen peroxide and 80% ethanol, rinsed again with dis- 
tilled water and air-dried in between individual sessions. The first ten 
sessions served as habituation sessions, during which no social cue was 
present in either social cue port. Therefore, there were no special cues 
available to the mice as they acclimatized to movement in the chamber 
and, for subjects involved in optogenetic experiments, movement while 
tethered to the patch cord. For behavioural testing, all mice—except 
for the optically activated mice—were exposed for 60 hin their home 
cage’ to bedding soiled by male mice, followed by an extra habituation 
session with blank filters in both stimulus ports after this home-cage 
treatment. Subsequently all mice were tested with social cues or opti- 
cal activation present in either port. The social cue or activation sides 
were randomly assigned between two ports across mice to control for 
any potential side bias. For the optical activation experiments, anose 
poke into the stimulation port triggered an external laser pulse (473- 
nm light, 60 pulses, 20 Hz) using a PulsePal! device. 

The ICR background mice that were not tested optogenetically were 
subjected to the following social cues in one port: recombinant darcin 
(1 pg pl); male urine with low levels of darcin (<0.1 pg pl in BALB/c) 
Ola-Hd urine, purchased from Harlan/Envigo)*; male urine with normal 
adult levels of darcin (1 pg pl, C57BL/6) Ola-Hsd urine, purchased from 
Harlan/Envigo)*; or recombinant darcin added to BALB/cJ Ola-Hd male 
urine with low levels of darcin (BALB/cJ plus recombinant darcin, lpg 
pl). In all instances, there was no odour in the other port. To confirm 
the presence or absence of darcin (18,893 Da MUP20), 12% SDS-PAGE 
gel electrophoresis of all urine samples was performed*”. 

The C57BL/6 Arc-CreER and nNOS-ires-Cre/ICR mice tested with opti- 
cal activation were subjected to optical activation in one port and no 
optical activation in the other port. The ICR outbred mice tested with 


AOB inactivation were subjected to either recombinant darcin (11 pgin 
10 pl) or male urine with normal levels of darcin (10 pl of CS57BL/6 Ola-Hd 
urine)‘ in one port and no odour in the other port. The nNOS-ires-Cre 
mice tested with MeA inactivation were subjected to darcin or male 
urine with normal levels of darcin (C57BL/6 Ola-Hd)* in one port and 
no odour inthe other port. All the optical-silencing experiments used a 
continuous light-on protocol during the entire test sessions. The final 
session for all mice was a recall session designed to quantify retained 
poke preferences in the absence of social cues or optical activation. 

For conditioned place preference experiments, C57BL/6 Arc-CreER 
and nNOS-ires-Cre/ICR mice were introduced into atwo-chamber condi- 
tioned place preference arena (22 x 16 x 28cm, length x width x height) 
for 100 min once per day for each session. The two chambers had dis- 
tinct walls decorated with visual cues (stripes and circles stickers, Con- 
text Kit for Conditioned Place Preference, Stoelting)); chambers were 
separated by acorridor anda divider, each containing a single nose port. 
Light stimulation was delivered to a port in one of the two chambers, 
and there was no optical activation in the other port. A nose poke into 
the light-stimulation port triggered an external laser pulse (473-nm 
light, 60 pulses, 20 Hz) using a PulsePal” device during the light- 
stimulation sessions only. During the habituation and recall sessions, a 
nose poke into the light-stimulation port did not trigger a laser pulse. 
Videos were recorded throughout the 100-min sessions. The positions 
of the mice were tracked using Ethovision, and the occupancy trajec- 
tories and time spent in each chamber were computed for analysis. 

To demonstrate that MeA nNOS neurons are indispensable for social 
cue reinforcement behaviours only, we optogenetically silenced nNOS 
neurons inthe MeA and tested the mice with water as a reinforcer rather 
than darcin. Mice were tested using atwo-port setup without any social 
cues. Before behavioural training, mice were gradually water-restricted 
over the course of a week and kept under water scheduling until the tests 
were concluded. Mice were placed in the behaviour chamber for 100 min 
once per day for each session during the dark phase (23:00 to 6:00) of 
their day/night cycle (7:00 to 19:00). The first ten sessions served as 
habituation sessions, during which no cue was present in either port. 
Mice were acclimatized to the movement in the chamber while being 
tethered to the patch cord. They were then subjected to male-soiled 
bedding exposure for 60 hin their home cage and an extra habitua- 
tion session with blank filters in both ports following this home-cage 
treatment. Subsequently, all mice were tested for cue sessions. The cue 
sides were evenly split ina random manner between two ports across 
the mice to control for any potential side bias. During cue sessions, a 
nose poke in one port rewarded the mice with 5 ul of water, and there 
was no reward for a nose poke in the other port. Behavioural training 
sessions lasted 100 min, during which the mice typically collected at 
least 4 ml of water. The final session for water-reinforcement behaviour 
was arecall session designed to quantify the retained poke preferences 
without any water reward. The behavioural hardware was controlled 
by custom MATLAB programs and a Bpod and PulsePal”. 

Investigators were blinded to the allocation of the mice during the 
experiments and data analysis. 


Stereotactic surgeries 

An adeno-associated virus (AAV) DJ serotype” (1.3 x 10° vg mI™ 
(genomic), 8 x 10°1IU mI (infectious) titre, Stanford Vector Core Facil- 
ity) carrying EFla DIO hChR (E123T/T159C)-p2A-eYFP-WPRE, EFla DIO 
NpHR3.0-eYFP, EFla DIO-eYFP or EFla NDHR3.0-eYFP construct was 
injected in 4- to 6-week-old mice. The mice were anaesthetized with an 
intraperitoneal injection of a ketamine and xylazine mixture (0.13 mg 
per g body weight ketamine and 0.01 mg per gram body weight 
xylazine). Small craniotomies were made above the posterior MeA 
(-2.0 mm AP and 2.3 mm ML from the bregma) or the AOB (3.2mm AP, 1 
mm MLand0.8-1.5mm DV). Virus was injected with a glass micropipette 
using a Picospritzer (General Valve). For posterior MeA injections, 
20-60 pulses of 10-ms duration were delivered at 0.2 Hz starting from 


a depth of 4.6 mm from the brain surface up to 5.2 mm in 200-um steps, 
waiting a minimum of 10 min per site to allow diffusion of the virus. 
After virus injection, fibre-optic cannulas were implanted. The mice 
received asupplementary dose of ketamine at 30- to 90-min intervals 
to maintain the depth of anaesthesia. The cannula was positioned with 
the help of a stereotaxic arm (David Kopf Instruments) and cannula 
holder (Doric Lenses) above the craniotomy. The optical cannula was 
gradually lowered close to the viral injection depth (100-300 um above 
the injection site). Two miniature watch screws (Micro-Mark) were fixed 
into the parietal plates as anchors. The cannula was secured to the skull 
with light-curable dental cement (Vitrebond Plus) followed by a layer of 
black dental acrylic (Lang Dental Manufacturing). For post-operative 
analgesia, ketoprofen (5 mg per kg body weight) was administered 
subcutaneously. The mice were allowed to recover for one week. 


Exposure of Arc-CreER mice to social cues 
One week after stereotaxic viral infection and cannula surgery, 6- to 
8-week-old Arc-CreER mice were transferred to a reverse day/night 
cycle. They were individually housed unless mentioned otherwise and 
oestrus was synchronized through exposure to male-soiled bedding for 
60 h**. Mice were then injected with 2 mg of tamoxifen (Sigma T5648), 
which was prepared as a10 mg mI‘ stock solution dissolved in a mix- 
ture of ethanol and sunflower seed oil (Sigma S5007). Five hours after 
tamoxifen injection, the mice were exposed to darcin, MUPI1, saline, cat 
salivary lipocalin (Fel-D4), ESP1™ (exocrine-gland secreting peptide), 
male urine with normal levels of darcin, female urine or male urine with 
low levels of darcin ona glass microfibre filter (10 mm diameter) placed 
through the roof of their home cage; 10 pl (equivalent to 11 pg of dar- 
cin**, MUP11*“, equivalent to 3.3 pg of Fel-D4" and 25 pg of ESP1") was 
used. The lactating females were separated from their pups five hours 
before tamoxifen injection and exposed to recombinant darcin between 
postpartum days 3 and 5. Recombinant cat Fel-D4 was produced using 
the pMAL Protein Fusion and Purification System (New England Bio- 
labs) and assayed by SDS-PAGE. The mouse ESP1 was synthesized by 
Atlantic Peptides. The mice were monitored with infrared cameras to 
confirm that they had interacted with the filters. Optical activation 
experiments were conducted three weeks after exposure to the cues. 
Three weeks after the tamoxifen injection, the Arc-CreER mice that 
were subjected to optical stimulation were re-exposed to darcin for 
2 hand then euthanized for immunohistochemistry. 


Immunohistochemistry 

Once the behavioural criteria for each behaviour assay were met, the 
mice were anaesthetized with a ketamine and xylazine mixture (0.30 mg 
per g body weight ketamine, 0.03 mg per g body weight xylazine) and 
perfused transcardially with 4% paraformaldehyde (PFA) ina phosphate 
buffer pH 7.4 (PBS). The brain was dissected and incubated at 4 °C in 
4% PFA, washed in 1x PBS, and stored in PBS at 4 °C until sectioning. 
Subsequently, 50-um coronal brain sections were made using a Leica 
VT1000S vibratome. The sections were incubated with a blocking solu- 
tion (5% normal goat serum and 0.1% Triton in PBS (PBST)), washed in 
0.1% PBST (3 washes, 15 min each) and incubated overnight at 4 °C with 
primary antibodies diluted in blocking solution. The following primary 
antibodies were used: anti-GFP (rabbit polyclonal, 1:1,000, Rockland), 
anti-GFP (chicken polyclonal 1:400, Aves Labs), anti-nNOS (rabbit 
polyclonal, 1:400, Invitrogen), anti-mCherry (rat monoclonal, 1:800, 
Thermo Fisher Scientific) and anti-Fos (goat and rabbit polyclonal, 
1:500, Santa Cruz Biotechnology; guinea pig polyclonal, 1:5,000, with 
RRID: AB_2814707, generated by S. Brenner-Morton, at ZMBBI, Colum- 
bia University). The following day the sections were washed in 0.1% PBST 
(3 washes, 15 min each) and incubated for 2h at room temperature with 
secondary antibodies at 1:500 dilutions (Alexa-594 goat anti-rabbit, 
Alexa-633 donkey anti-goat, Alexa-488 goat anti-rabbit, Alexa-488 goat 
anti-chicken, Alexa-594 goat anti-rat, Alexa-488 goat anti-guinea pig, 
Jackson ImmunoResearch, and NeuroTrace Alexa-640/660, Molecular 
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Probes). Sections were washed in 1x PBS for 15 min and mounted using 
Vectashield mounting medium (Vector Laboratories). Confocal images 
were acquired using an LSM780 Zeiss microscope at 10x, 20x and 65x 
magnifications. Area and cell counts were manually quantified using 
Image) (NIH) software by an individual who was blinded to the experi- 
mental conditions. 


Statistical analysis 

Port preferences within each session type (habituation day 2, habitua- 
tion day 13, cue exposure and recall) for each subject were compared by 
matched Wilcoxon signed-rank tests. Port bias for the left port over the 
right port was computed by taking the difference in total poke count 
between the left and the right ports for each mouse. Comparisons 
were across each session (habituation day 2, habituation day 13, cue 
exposure, and recall) using the Wilcoxon signed-rank test. Port bias was 
compared across independent treatment cohorts by Mann-Whitney 
(for pairwise comparisons) and Kruskal-Wallis tests (for three-way 
comparisons). All poke count data did not approximate to normality 
and so non-parametric tests were used. Call counts were compared 
across independent cohorts of mice using a Mann-Whitney test and 
across different sessions of the same cohort of mice using the Wil- 
coxon signed-rank test. Adjusted Pvalues were reported where multiple 
comparisons were made on the same sample set by using the Holm’s 
sequential Bonferroni correction method. The probabilities of urinary 
scent-marking behaviour were compared across sessions using the 
McNemar test. Exact tests were performed for all comparisons, includ- 
ing those in which the sample sizes were small (the discordant pairs 
in some of our comparisons were less than 25). The mean latencies to 
first urinary scent marking were compared using a paired t-test. The 
latency data approximated to normality as confirmed by Shapiro-— 
Wilk, Lilliefors, Kolmogorov-Smirnov, Anderson Darling, D’'Agostino-K 
squared and Chen-Shapiro tests. All analyses were performed using 
R, OriginLab and MATLAB. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The datasets generated and/or analysed during the current study are 
available from the corresponding authors on reasonable request. 
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Extended Data Fig. 1| Darcin and photoactivation of posterior MeA neurons 
condition scent-marking place preference. a, Representative frames from 
videos of the pheromone (1) and photoactivation (2) sessions, and free-range 
behaviours (3). b, Distance from urinary drop to each of the poke ports during 
various sessions. Individual frames were analysed using Adobe Photoshop CC 
to quantify the distance from the centre of a urinary drop to the base of each 
poke port. Units are scaled from pixels to centimetres. Distances were 
compared using the two-sided Wilcoxon signed-rank test (***P< 0.0005, 
*P=0.01;n=24 mice, (1);n=12 mice, (2);n=20 mice, (3)).c, Area of urinary 
drops under various conditions. Individual frames were analysed using Adobe 


Photoshop CC to quantify the area of the urinary marks. Units are scaled from 
square pixels to square centimetres. Scent-mark area (mean+s.e.m., 

cm’): darcin, 5+ 0.05,n=24 mice; recall of darcin, 5+ 0.09, n=14 mice; 
photoactivation, 4+ 0.4,n=12 mice; recall of photoactivation, 4+0.5,n=8 
mice; free urination, 13 + 2;n=20 mice. Areas were compared using the two- 
sided Mann-Whitney test (***P< 0.0005), adjusted for multiple comparisons. 
The bounds in the box plots in b, care defined by the 25th and 75th percentile of 
the distribution. The lines represent the median and the upper and lower 
whiskers represent the 75th percentile + 1.5 x IQR and 25th percentile —1.5 x 
IQR, respectively. 
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Extended Data Fig. 2| Darcin and photoactivation of posterior MeA neurons 
reinforce recall of vocalization and scent-marking behaviours. a—c, Data for 
individual mice for all unique sessions across the study were pooled. a, Mean 
(horizontal line; n= 43 mice (darcin group), n=24 mice (photostimulation 
group)) and total calls made by individual mice (diamonds) detected during 
various sessions. Call counts were compared using the two-sided Wilcoxon 
signed-rank test within the respective groups (***P< 0.0005), adjusted for 
multiple comparisons. b, Latency fromthe start of the session to urinary 
marking and vocalization behaviour (mean +s.e.m., seconds) during exposure 
to darcin (3,160 +311, n=24 mice), recall of darcin exposure (956 + 217, n=14 
mice), photostimulation (4,195 + 372, n=12 mice) and subsequent recall 

(1,315 + 418, n=8 mice) sessions. Latencies were compared within groups using 
the matched-pair two-sided f-test (*P=0.005, ***P= 0.00009). The bounds in 
the boxplots are defined by the 25th and 75th percentile of the distribution. 
The line represents the median and the upper and lower whiskers represent 
75th percentile + 1.5 x IQR and 25th percentile — 1.5 x IQR, respectively. 

c, Probability of urinary scent-marking and vocalization behaviours. Mean 
probabilities are given for the darcin session (0.6, n=43 mice), recall of darcin 
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session (0.3,n=43 mice), photostimulation-evoked urinary marking and 
vocalization (0.5,n=24 mice) and recall of photostimulation-evoked 
behaviours (0.3, n= 24). Probabilities were compared using the two-sided 
McNemar test (*P< 0.05). d, Probability and mean latency to first urinary scent 
marking in the different sessions (n=9 mice). Data from 100-min habituation 
sessions (mean +s.e.m., latency for urination, seconds) and after exposure to 
male-soiled bedding in the home cage (1,411+126), low-darcin urine from 
BALB/c mice (1,116 + 232), recall session of BALB/c urine (1,607 + 268), urine 
from C57BL6/6J mice containing normal levels of darcin (2,666 + 337) and recall 
of C57BL6/6J urine (1,032 + 198) are shown. Probabilities were compared using 
the two-sided McNemar test (*P= 0.02), adjusted for multiple comparisons. 
Latencies were compared within groups using the matched-pair two-sided 
t-test and across groups using the unpaired two-sided t-test (**P=0.0008, 
*P=0.02), adjusted for multiple comparisons. Scent-marking behaviours in 
response to low-darcin urine during the subsequent recall sessions were 
compared (habituation to recall, P=1, cue to recall session comparison, P= 0.1, 
two-sided McNemar test). 
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Extended Data Fig. 3| See next page for caption. 
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Extended Data Fig. 3 | Activation of darcin-responsive neurons inthe 
posterior MeA recapitulates darcin-induced behaviours. a, Heat map 
showing occupancy of the chamber during a habituation, photostimulation 
and recall session. b, Occupancy plot showing the percentage of time spent in 
the photostimulation room. Arc-CreER mice were exposed to darcin (magenta), 
saline (green) or MUP11 (blue). The plot shows the mean +s.e.m. (n=5 mice per 
group, total n=15 mice) percentage of time spent in stimulation room during 
habituation, photostimulation and recall sessions. For occupancy time, 
pairwise comparisons were performed using the two-sided Mann-Whitney test 
(*P<0.05) and three-way comparisons were performed using Kruskal-Wallis 
tests (habituation P= 0.6, light stimulation P=0.009 and recall sessions 
P=0.008). c-f, Activation of nNOS neurons inthe posterior MeA recapitulates 
darcin-induced behaviours. c, d, Cumulative poke counts during habituation 
(laser off; 1), light stimulation (laser on; 2), and recall (laser off; 3) sessions in 
mice expressing eYFP (c) or ChR2 (d) innNOS neurons. Light stimulation was 
performed in one port (red) and not inthe second port (blue). During 
habituation (1) and recall (3) sessions, no light stimulation was given, and red 
and blue reflect right and left ports, respectively. Mean (bold lines, n=11 mice 


for each group) and individual (fine lines) cumulative poke counts are shown. 
The time-stamps for ultrasonic vocalization and scent-marking behaviours are 
indicated as arrowheads (d (2, 3)). Poke counts were compared using the two- 
sided Wilcoxon signed-rank test (***P= 0.0001). Control group (eYFP) port 
entries (c) are contrasted to the ChR2 group (d) during light stimulation (red 
entries for ChR2 (d (2)) compared to eYFP (c (2)); P=0.0002) and recall sessions 
(red entries for ChR2 (d (3)) compared to eYFP (c (3)); P= 0.0002, two-sided 
Mann-Whitney test, adjusted for multiple comparisons). e, Occupancy plot 
showing the mean percentage of time spent in the photostimulation room by 
all mice during various sessions. nNOS-ires-Cre mice were injected with AAV 
encoding either eYFP (green) or ChR2-eYFP (purple); plots are colour-coded to 
their respective groups; n=6 mice per group, n=12 mice total. Occupancy 
times were compared using a two-sided Mann-Whitney test (*P< 0.05). f, Mean 
(horizontal lines, n=11 per group, n=22 total) and total calls made by individual 
mice (diamonds) detected during the photostimulation (2) sessions in mice 
expressing eYFP (c (2)) or ChR2 (d (2)) innNOS neurons. Call counts were 
compared using the two-sided Mann-Whitney test (*P=0.007). 
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Extended Data Fig. 4 | Inlactating females, darcin activates mitral cellsin 
the AOB but fails to activate MeA neurons. a-c, Representative images 
showing Fos expression (orange) and NeuroTrace (blue) in sagittal sections of 
the AOB following exposure to saline (a) or darcin in virgin females (b) and 
lactating females (c). Experiment was independently repeated on 6 mice for 
each group. d, Bar plots quantifying Fos-expressing cells inthe AOB. Fos counts 
(mean ¢+s.e.m.): saline 153 + 38, darcin in virgin females 378 + 35, darcinin 
lactating females 358 + 45; n=6 mice per group. Cell counts were compared 
using the two-sided Mann-Whitney test (*P= 0.02), adjusted for multiple 
comparisons. e, Bar plots quantifying the mitral/tufted cells inthe AOB. 
Number of cells (mean +s.e.m.): saline 1,188 +167, darcin in virgin females 
1,129 + 93, darcin in lactating females 1,210 + 163; n=6 mice per group. Cell 
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counts were compared using the two-sided Mann-Whitney test. 

f, g, Representative images showing eYFP expression in coronal sections of the 
posterior MeA of Arc-CreER mice after exposure to darcin in virgin females (f) 
and lactating females (g). Experiment was repeated on 13 mice and 4 miceinf 
and g, respectively. h, Bar plots quantifying eYFP-expressing cells inthe MeApd 
and the MeApyv. Cell counts were compared using the two-sided Mann-Whitney 
test, adjusted for multiple comparisons. *P= 0.008, **P=0.0006, ***P< 0.0005. 
Mean +s.e.m. eYFP-expressing cell counts: saline, 16 +5inthe MeApd and 23+7 
inthe MeApv, n=13 mice; darcin exposure in virgin females, 251+ 29 inthe 
MeApd and 115 +16 in the MeApv, n=13 mice; darcin exposure in lactating 
females, 23 +11in the MeApd and 15+12 inthe MeApv,n=4 mice. 
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Extended Data Fig. 5 | Identification of neurons in the posterior MeA that 
respond to vomeronasal stimuli and their overlap with the genetic marker 
nNOS. a, Representative images showing the stimulus-responsive (eYFP, 
orange) and nNOS-expressing (cyan) neurons inthe posterior MeA of 
Arc-CreER mice exposed to cat salivary lipocalin Fel-D4 (n=5 mice), saline 
(n=8 mice), ESP1(n=5 mice), MUP11(n=5 mice), female urine (n=5 mice), male 
urine with low levels of darcin (n=4 mice), male urine with normal levels of 
darcin (n=9 mice) and darcin (n=7 mice). b, Corresponding box plots 
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quantifying the percentage overlaps between the stimulus-responsive (eYFP) 
and nNOS* neurons in the posterior MeA of mice exposed to the various stimuli. 
Orange plots represent the percentage of YFP cells that overlap with nNOS; 
cyan plots represent the percentage of nNOS cells that overlap with YFP. The 
bounds in box plots are defined by the 25th and 75th percentile of the 
distribution. The lines represent the median and the upper and lower whiskers 


represent the 75th percentile + 1.5 x IQR and 25th percentile -1.5 x IQR, 
respectively. 
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Extended Data Fig. 6|See next page for caption. 
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Extended Data Fig. 6 | The additional effects of silencing nNOS neurons in 
the posterior MeA. a—c, Functional convergence of both olfactory systems 
mediated by the posterior MeA is pivotal for male urine reinforcement. 

a, Timeline of the preference assay. Mice were habituated in the chamber for 
ten days (1), then exposed to male-soiled bedding for 60 hin their home cage 
(2), followed by one additional day of habituation before male urine (with 
normal levels of darcin (1 pg p”)) was presented in one of the two ports (3). 
Urine was removed for the recall session one day later (4). Port preference was 
quantified from port entries. b,c, Cumulative poke counts during habituation 
(1), habituation after treatment (2), exposure to male urine (3) and recall (4) 
sessions for mice expressing eNDHR-eYFP (n=10) with (b) and without (c) 
optical silencing of nNOS neurons. Poke counts were recorded on the days 
indicated by purple arrows ina. Mice were exposed to male urine in one port 
(red) anda blank filter (blue) inthe second port (3). During habituation (1, 2) 
and recall (4) sessions both ports contained a blank filter. Mean (bold lines) and 
individual (fine lines) cumulative poke counts are shown. Poke counts were 
compared using the two-sided Wilcoxon signed-rank test (***P= 0.0002). The 


effect of silencing nNOS neurons is quantified with matched pair differences 
(male urine session comparisons, b (3) toc (3), P=0.002) and recall of male 
urine with darcin (recall session comparisons, b (4) toc (4), P=0.002) using 

the two-sided Wilcoxon signed-rank test, adjusted for multiple comparisons. 
d,e, Optical silencing of nNOS neurons does not affect recall of darcinmemory. 
Cumulative poke counts during habituation (1), habituation after treatment (2), 
darcin (3) and recall (4) sessions in mice expressing eNpHR (n=11) with optical 
silencing during all sessions (d (1—4)) and with optical silencing during recall 
sessions only (e (4)). Poke counts were recorded on the days indicated by purple 
arrows ina. Mice were exposed to darcin in one port (red) anda blank filter 
(blue) in the second port (3). During habituation (1, 2) and recall (4) sessions 
both ports contained a blank filter. Mean (bold lines) and individual (fine lines) 
cumulative poke counts are shown. Poke counts were compared using the two- 
sided Wilcoxon signed-rank test (**P= 0.001). The effect of silencing nNOS 
neurons during recall sessions was tested with matched pair differences (c, cue 
(e (3)) to recall (e (4)) comparisons, laser off (e (3)) and on (e (4)), P= 0.1, using 
the two-sided Wilcoxon signed-rank test, adjusted for multiple comparisons. 
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Extended Data Fig. 7| Mice subjected to optical silencing of nNOS neurons 
retained a motivation to poke. To establish the primacy of the MeA in 
mediating darcin-evoked behaviours rather than altering general motivation, 
mice expressing eNpHR in nNOS neurons were also tested. a, Timeline of the 
two-port preference assay. b-d, Cumulative poke counts during habituation 
(1), habituation after exposure to male-soiled bedding in the home cage (2), 
darcin exposure (3) and recall (4) sessions with (b) and without (c) optical 
silencing of nNOS neurons, and with optical silencing again after 4 weeks (d) 
(n=11 mice). Poke counts were recorded on the days indicated by purple arrows 
ina. Mice were exposed to darcin in one port (red) and a blank filter (blue) inthe 
second port. During habituation (1, 2) and recall (4) sessions both ports 
contained a blank filter. Mean (bold lines) and individual (fine lines) cumulative 
poke counts are shown. Poke counts were compared using the two-sided 
Wilcoxon signed-rank test (**P= 0.001). The effect of silencing nNOS neurons 
after alearning experience is quantified during habituation sessions after 
exposure to soiled bedding in the home cage (port entries to the same port 
(red) with blank filters are compared during habituation after home-cage 


treatment sessions in b (2) and c (2), laser onand off, P=0.002, in b (2) andd (2), 
laser on, P=0.001, and c (2) and d (2), laser offand on, P=0.5). The paired count 
differences (red—blue port) are compared across darcin sessions (b (3) tod (3), 
laser on, P=0.5, andc (3) tod (3), laser off and on, P=0.0001) and recall of 
darcin (recall session comparison b (4) tod (4), P=0.9, andc (4) tod (4), 
P=0.0001) using the two-sided Wilcoxon signed-rank test, adjusted for 
multiple comparisons. e, Optical silencing of nNOS neurons in the MeA does 
not affect non-social reinforcement behaviour. Cumulative poke counts during 
habituation (1), habituation after treatment (2), and water (3) sessions in mice 
expressing eNpHR (n=12) innNOS neurons in the MeA with silencing. Poke 
counts were recorded on the days indicated by purple arrows ina. Water- 
restricted mice were rewarded witha drop of water (5 11) inone port (red) anda 
blank filter in the second port (blue). During habituation (1, 2) sessions both 
ports contained a blank filter. Mean (bold lines) and individual (fine lines) 
cumulative poke counts are shown. Poke counts were compared using the two- 
sided Wilcoxon signed-rank test (**P=0.0005). 
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Extended Data Fig. 8 | Ultrasonic vocalizations that are emitted by mice sizes) for frequencies are given in Extended Data Table 2 for the locations 
exposed to darcin or stimulated optogenetically consist of seven unique indicated by the corresponding letters onthe spectrograms. b, Percentages of 
syllable categories. a, Representative spectrograms of ultrasonic different call categories emitted by mice exposed to darcin (n=24, in green) 
vocalizations classified into seven categories of call. The heat maps showthe and optogenetically stimulated (n=12, inblue). 


intensities of the vocalizations. Descriptive statistics (mean +s.d., sample 


Extended Data Table 1| Cell counts for exposure to different cue types, nNOS expression and the overlaps in the posterior MeA 


Cue Type 


Darcin (n=7) 


Male urine normal darcin 
(n=9) 


Male urine low darcin 
(n=4) 

Female 

Urine 

(n=5) 

MUP11 

(n=5) 


ESP1 
(n=5) 
Saline 
(n=8) 


Fel-D4 
(n=5) 


YFP + 


in 


MeApd 


200 + 18 


262 + 45 


102 + 6* 


70 + 12* 


49 + 10* 


85 + 18* 


YFP + YFP + % 
nNOS+ nNOS+ YFP+ nNOS+ YFP + nNOS+ nNOS/ 
nNOS+ YFP + nNOS + YFP 
in in in 
in in in in in MeA MeA MeA % 
MeApd MeApd MeApv MeApv MeApv pd+pv pd+pv pd+pv Overlap 
220422 146415 155421 179428 120421 311446 348454 232437 66+3 
155423 114416 168443 114425 63413 344433 275445 159421 59+5 
301 + es ri 
152417 36412* 64426 123 21+8 160414 250422 41+17* 22+8 
160422 27+47* 4244 172461 17+5 104415* 297447 452411* 112+3* 
206+53 174 5* 53412 183449 2648 102417* 390493 432 8* 16 + 6* 
323448 144 3* 111430 576475 31+8 197+43 899+93 45411* 5+1* 
264+41 6+43* 144 8* 290+73 544 36415* 554+92 1124 6* 21* 
204428 0+0* 8+8 189+51 0+0* 13413* 314491 0+0* 0+0* 


Counts (mean + s.e.m. cell counts) quantifying region-specific and overlapping expression of cue-responsive (eYFP* neurons) and nNOS-expressing neurons in the MeApd and the MeApv 
for female mice exposed to darcin, male urine with normal levels of darcin, male urine with low levels of darcin, female urine, MUP11, ESP1, cat Fel-D4 or saline. The percentage of overlap 


(mean + s.e.m.) is quantified between total eYFP and nNOS-expressing neurons in the posterior MeA. Comparisons are made pairwise between darcin and all other cue types for eYFP*counts 
and the overlaps using the two-sided Mann-Whitney test (*P < 0.05). Comparisons are made pairwise between darcin and all other cue types for the percentage of nNOS-expressing neurons 
overlapping with eYFP using the two-sided Mann-Whitney test (*P < 0.05). 
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Extended Data Table 2 | Syllable categories for darcin and light-evoked ultrasonic vocalizations 


Darcin-Evoked Light-Evoked P(Tst) 
(mean + sd) (mean + sd) 
Complex n=5 n=9 
A 62kHz + 23kHz 67kHz + 2kHz 
B 68kHz + 22kHz 72kHz + 4kHz 
Duration 42ms + 18ms 111ms + 54ms : 
Harmonics n=8 n=15 
A 83kHz + 15kHz 66kHz + 4kHz * 
B 91kHz + 22kHz 78kHz + 6kHz 
Cc 45kHz + 8kHz 49kHz + 3kHz 
D 46kHz + 11kHz 49kHz + 2kHz 
Duration 77ms + 26ms 81ms + 36ms 
Short n=3 n=9 
A 75kHz + 28kHz 74kHz + 11kHz 
B 78kHz + 26kHz 73kHz + 11kHz 
Duration 8ms + 4ms 8ms + 3ms 
Two-syllable n=45 n=27 
A 75kHz + 8kHz 81kHz + 8kHz * 
B 86kHz + 12kHz 100kHz + 6kHz igs 
Cc 57kHz + 10kHz 67kHz + 6kHz cai 
D 63kHz + 9kHz 72kHz + 8kHz bi 
Duration 56ms + 25ms 74ms + 23ms ba 
Upwards n=17 n=39 
A 64kHz + 12kHz 72kHz + 9kHz bi 
B 79kHz + 8kHz 81kHz + 12kHz 
Duration 19ms + 9ms 36ms + 17ms eee 
Frequency steps n=13 n=8 
A 74kHz + 10kHz 77kHz + 10kHz 
B 85kHz + 9kHz 94kHz + 14kHz 
Cc 52kHz + 11kHz 62kHz + 10kHz ‘ 
D 53kHz + 8kHz 64kHz + 13kHz * 
E 86kHz + 7kHz 100kHz + 15kHz * 
F 76kHz + 10kHz 91kHz + 17kHz 7 
Duration 71ms + 26ms 97ms + 36ms 
Flat n=7 n=34 
A 62kHz + 12kHz 7OkHz + 6kHz 
B 60kHz + 13kHz 71kH + 7kHz 
Duration 47ms + 23ms 74ms + 47ms * 


Different call categories emitted by mice exposed to darcin (n = 24) and optogenetically stimulated (n = 12). Frequencies and durations are compared with the unpaired two-sided t-test 
(*P < 0.05, **P < 0.005, ***P < 0.0006). 


| | at | pki [ S¢ arc | Corresponding author(s): Prof. Adam Kepecs and Prof. Richard Axel 


Last updated by author(s): 12/01/2019 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
Lo AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection The behavioral nose poke data were acquired through a MATLAB 2009b-2013a (MathWorks Inc, Natick, MA) interface and an Arduino- 
powered device (Bpod v0.5, Sanworks, LLC, Stony Brook, NY). Avisoft Recorder USGH software was used to record vocalizations and 
integrate time codes from the Horita PTG2. The position of the mice was tracked using video tracking software XT10 (Noldus, Ethovision 
XT 10). A video recorder (AJA Ki Pro Recorder, AJA Video Systems, Grass Valley, CA, United States), which was connected to the camera 
and the Horita PTG2, was used to record video for the entire duration of the session. The time code generated by the Horita PTG2 was 
visible as an OLED (Organic Light Emitting Diode) display within the video window of the Marshall Electronics camera recording through 
the AJA recorder and was also recorded by screen-capturing software (Adobe Captivate CC 2017, Adobe Systems Inc.).The behavioral 
hardware (valves for water delivery and port sensors) and the laser for optogenetic stimulation were controlled by custom MATLAB 
2009b-2013a programs (MathWorks Inc, Natick, MA) and two Arduino-powered devices (Bpod v 0.5, Sanworks LLC and PulsePal, 
Sanworks LLC). 


Data analysis The behavioral nose poke data were analyzed by a custom program written in MATLAB 2009b-2013a (MathWorks Inc, Natick, MA). To 
analyze the nature of the ultrasonic vocalizations, Avisoft SAS Lab Pro Version 5.2.12 was used. A fast Fourier transformation (FFT) was 
applied to the recordings to generate spectrograms; the following parameters were used: FFT length of 256, Hamming window, time 
window overlap of 50%, frequency resolution of 977 Hz, and time resolution of 0.5 ms. The vocalizations were frequency modulated to a 
human-audible range using Avisoft SAS Lab Pro Version 5.2.12. All spectrograms were additionally parametrized using SAP 2011, MUPET 
and MATLAB 2009b-2013a software. To analyze the urinary scent marking behavior of the animals, Adobe Premiere Pro CC 2017 (Adobe 
Systems Inc.) was used to process the video recordings from each session. The position of the mice was tracked using video tracking 
software XT10 (Noldus, Ethovision XT 10), and the occupancy trajectories and time-spent in each chamber were computed for analysis. 
All statistical analyses were performed using R (R: A language and environment for statistical computing, R Foundation for Statistical 
Computing, Vienna, Austria), OriginLab Pro 2017-2019 (OriginLab Corp., Northampton, MA), and MATLAB 2009b-2013a(MathWorks Inc, 
Natick, MA). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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Policy information about availability of data 
All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- A description of any restrictions on data availability 


The datasets generated and/or analyzed during the current study are available from the corresponding author on reasonable request. 
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Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size No statistical method was used to determine the sample size prior to the study, but sample sizes were consistent with the research papers in 
the relevant field. 


Data exclusions _ In very rare cases, mice that are confirmed to have no viral expression after behavioral testing were excluded, and these exclusion criteria 
were predetermined. 


Replication All attempts to replicate the behavioral, opto-genetics, and immunohistochemical experiments were successful, indicating the robustness of 
the presented results. To guarantee reliable replication of our results, we preferred to work with high infectious titer (3e6 |U/ml) and DJ 


serotype AAV viruses (as purchased from Stanford virus core), especially when we delivered the viruses into medial amygdala. 


Randomization Animals were randomly assigned to the test. The social cue or light stimulation (for optogenetic activation) sides were evenly split in a random 
manner between two ports across the animals to control for any potential side bias. 


Blinding All behavioral experiments were scored by an individual blind to the social cue or light stimulation side and experimental design. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used All antibodies used in this study are commercially available and validated by the manufacturers, except the guiena pig c-fos 
antibody . The following primary antibodies were used: anti-GFP (rabbit polyclonal, 1:1000, Rockland, catalog #: 600-401-215, 
ot#s: 28983, 33267or chicken polyclonal 1:400, Aves Lab, catalog#:GFP-1020, lot#s: GFP879484, GFP697986), anti-nNOS (rabbit 
polyclonal, 1:400, Invitrogen, catalog#:61-7000, lot#s: 1207899A, 987786A, 1578834A, 797629A), and anti-c-fos (goat and rabbit 
polyclonal, 1:500, Santa Cruz Biotechnology, catalog #: rabbit sc 52, lot#: B1115, goat sc 52-G, lot#s: 10215, J1613, and K1109, 
and guinea pig polyclonal c-fos, with RRID#, AB_2814707, generated by Susan Brenner-Morton at ZMBBI, Columbia 
University). Secondary antibodies are all used at 1:500 dilutions (alexa-594 goat anti-rabbit: Jackson ImmunoResearch, catalog#: 
111-585-003, lot#: 135 626, 140268, alexa-594 goat anti-rat: BioLlegend, Clone: Poly4054, catalog#: 405422, lot#: B262774, 
alexa-633 donkey anti-goat: Life Technologies, catalog#: A21082, lot#1711470, alexa-488 goat anti-rabbit: Jackson 
mmunoResearch, catalog#: 111-545-006, lot#131752, alexa-488 goat anti-chicken: Jackson ImmunoResearch, catalog#: 


103-545-155, lot#: 139170, alexa-488 donkey anti-rabbit: InVitrogen catalog#: A21206, lot#: 1981155, alexa-488 goat anti- 
guinea pig: Jackson ImmunoResearch, catalog#:706-545-148, lot#s: 127887, 143798, alexa-488 donkey anti-chicken: Jackson 
ImmunoResearch, catalog#:703-545-155, lot#s: 126602, alexa-594 donkey anti-rabbit: InVitrogen, catalog#: A21207, lot#: 
1987293, and NeuroTrace alexa 640/660, Molecular Probes, catalog#: N21483, lot# 1656094). 


Validation All antibodies are validated to react with corresponding mouse antigens. 


Rabbit anti-nNOS: validated by Invitrogen and Thermofisher (https://www.thermofisher.com/us/en/home/life-science/ 
antibodies/invitrogen-antibody-validation.html?icid=ab-search-learning-ab-validation), tested with the following applications, 
ELISA, Western Blotting, Immunohistochemistry (frozen), see the reference: Huang, PL. et. al. (1995) Nature 377: 239-242. 
CiteAb database reports 53 citations for this antibody. 


Rabbit anti-GFP: validated by Rockland Immunochemicals (https://rockland-inc.com/store/Antibodies-to-GFP-and-Antibodies- 
to-RFP-600-401-215-O4L_18562.aspx), tested with ELISA, Western Blotting, immunohistochemistry, IF microscopy applications, 
and reported that no reactivity was observed against Human, Mouse or Rat serum proteins. CiteAb database reports 36 citations 
for this antibody. 


Chicken anti-GFP: validated by Aves (https://www.aveslabs.com/products/green-fluorescent-protein-gfp-antibody), tested 
with Western blot, and immnunohistochemistry. See the reference: Lu J. et. al. 2017(10):1377-1383 for both GFP antibodies. 
CiteAb database reports 702 citations for this antibody. 
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c-fos antibodies are validated by Santa Cruz Biotech. using immunohistochemical, Western Blotting and immunofluorescence. 
See the references: Choi GB. et. al. Cell 146(6):1004-15 and Root CM. et. al. (2014) Nature 515 (7526): 269-73. These c-fos 
antibodies are discontinued by the manufacturer, therefore we generated a guinea pig polyclonal c-fos antibody with the help of 
Susan Brenner-Morton at ZMBBI, Columbia University. We validated guinea pig c-fos by comparisons to commercially available c- 
fos antibodies from Santa Cruz. We performed immuno-histochemistry on free floating mouse brain sections with various c-fos 
antibodies and counted the number of cells stained by each antibody for comparisons in various mouse brain regions. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Experiments were conducted with 279 female mice between 6 and 30 weeks old. Mice were purchased at 4 weeks old. The 
mouse lines used were Arc-CreER (a gift from Christine Denny at Columbia University; also available from Jackson Laboratory, Jax 
stock #022357), ICR outbred (CD-1) wild-type mice (Harlan/Envigo), Ai14 (Rosa-CAG-LSL-tdTomato), nNOS-ires-CRE (Jax stock 
#017526), vGlut-ires-CRE (Jax stock #028863), Gad2-T2a-NLS-mCherry (Jax stock #023140). The nNOS-ires-CRE mice were 
crossed to ICR outbred mice (Harlan/Envigo) for 15 generations to exchange their genetic background to the ICR mice. 


Wild animals This study did not involve any field captured animals. 
Field-collected samples This study did not involve any field captured samples. 
Ethics oversight All surgical and experimental procedures were done in accordance with the National Institute of Health’s Guide for the Care and 


Use of Laboratory Animals and approved by the Cold Spring Harbor Laboratory and Columbia University Medical Center 
Institutional Animal Care and Use Committees. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Cortical organoids are self-organizing three-dimensional cultures that model features 


of the developing human cerebral cortex’”. However, the fidelity of organoid models 
remains unclear* °. Here we analyse the transcriptomes of individual primary human 
cortical cells from different developmental periods and cortical areas. We find that 
cortical development is characterized by progenitor maturation trajectories, the 
emergence of diverse cell subtypes and areal specification of newborn neurons. By 
contrast, organoids contain broad cell classes, but do not recapitulate distinct cellular 
subtype identities and appropriate progenitor maturation. Although the molecular 
signatures of cortical areas emerge in organoid neurons, they are not spatially 
segregated. Organoids also ectopically activate cellular stress pathways, which 
impairs cell-type specification. However, organoid stress and subtype defects are 
alleviated by transplantation into the mouse cortex. Together, these datasets and 
analytical tools provide a framework for evaluating and improving the accuracy of 
cortical organoids as models of human brain development. 


Organoid models harness the natural self-assembly properties of 
development to produce 3D cultures from stem cells that recapitulate 
aspects of an endogenous organ’s structure and function. Organoids 
have applications in disease modelling, drug screening, and regen- 
erative medicine. Single-cell RNA sequencing (ScRNA-seq) provides a 
powerful method for comparing the fidelity of organoid cell types to 
their primary cell counterparts across tissues. In the liver and kidney, 
benchmarking comparisons with normally developing organs indicate 
that 3D culture better recapitulates primary cell types than adherent 
culture®’. However, the lack of acomprehensive catalogue of cell types 
in the normal developing human brain and their molecular features 
has prevented careful evaluation of the strengths and weaknesses of 
cerebral organoids. 

In vitro models of human cortical development are particularly valu- 
able because early events during neurogenesis and synaptogenesis 
may underlie neuropsychiatric disorders, and experimental access to 
the developing human cortex is otherwise limited. Initial studies have 
indicated that broad classes of cells are preserved in cortical organoid 
models? but also hint at distinctions between organoids and primary 
cells**”°. In particular, the extent to which spatial and temporal gradi- 
ents of gene expression and cell-type maturation are recapitulated in 
organoids is unclear (Extended Data Fig. 1). Although analysis of some 
of the first organoid models suggested the emergence of spatial gradi- 
ents!” we know little about the fidelity and organization of areal cell 
types in organoids, in part because we lack molecular cell signatures 
of cortical areas in the developing brain. 


Comparison of human cortex and organoids 


To evaluate the fidelity of cortical cell types in organoids, we performed 
high-throughput scRNA-seq of samples from the developing human 
cortex and cortical organoids, and compared the results with published 
organoid single-cell sequencing datasets. To characterize molecu- 
lar features and gene-expression signatures during human cortical 
development, we performed scRNA-seq of dissociated cells from five 
cortical samples collected at 6-22 gestational weeks (GW), encompass- 
ing the period of neurogenesis. To assess cell-type differences across 
cortical areas, we studied primary samples from seven regions, includ- 
ing prefrontal (PFC), motor, parietal, somatosensory and primary visual 
(V1) cortices as well as hippocampus, resulting in transcriptomic data 
from 189,409 cells (Methods, Fig. 1a, Supplementary Table 1). These 
primary data were compared to data from 235,121 single cells from 
37 organoids (Fig. 1b). We generated forebrain organoids by following 
three previously published protocols using different levels of directed 
differentiation to evaluate whether increased stringency in pattern- 
ing signals results in more endogenous-like cellular subtypes!*+*” 
(Extended Data Fig. 2). To assess biological replicability, we used three 
induced pluripotent stem cell (PSC) lines and one embryonic stem cell 
line. Organoids were maintained under the same conditions, except 
for protocol-specific medium formulations (Extended Data Fig. 2), and 
samples were collected for immunohistochemistry and scRNA-seq 
after 3, 5, 8,10, 15 and 24 weeks of differentiation to evaluate relevant 
cell types (Extended Data Figs. 3, 4). Last, we compared our reference 
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Fig. 1|Celltypesin cortical primary and organoid samples. a, Single-cell 
sequencing of primary cortical cells identifies anumber of cell types. These 
cell types are labelled in the ¢-distributed stochastic neighbour-embedding 
(t-SNE) plot on the left, and markers of cell-type identity depict progenitors 
(SOX2), oRGs (HOPX), IPCs (EOMES), newborn neurons (NEUROD6), maturing 
neurons (SATB2) and inhibitory interneurons (DLX6-AS1). Single-cell datacan 
be explored at https://organoidreportcard.cells.ucsc.edu. b, Single-cell 
sequencing of cortical organoid cells generated from four different 


dataset to published organoid single-cell data generated from 276,054 
cells across eight protocols, including time points from six months toa 
year? °8°3"5, This enabled us to extend our comparisons to later stages 
of differentiation (Extended Data Figs. 5, 6). 


Impaired cell-type fidelity in organoids 


We identified broad cell types that corresponded to radial glia, inter- 
mediate progenitor cells (IPCs), maturing neurons and interneurons 
in both datasets (Fig. 1, Supplementary Tables 2, 3).In primary cortical 
samples, we also found clusters of microglia, oligodendrocyte pre- 
cursors, mural cells and endothelial cells. Additionally, we identified 
previously described subtypes of radial glia, as well as a few instances of 
area- and layer-specific subtypes of excitatory neurons. Inthe primary 
samples, there was extensive intermixing within clusters of ages and 
cortical areas (Extended Data Fig. 4a). Within our organoids, cell lines, 
protocols and ages also intermixed, with variation primarily resulting 
from differences between cell types. Across lines and protocols, the 
forebrain marker FOXGI1 was broadly expressed (Extended Data Figs. 5b, 
7a), and the cell-type composition was similar across organoids of the 
same ages, even between lines and protocols, validating differentiation 
towards forebrain identity. Organoids had 45% fewer cells expressing 
HOPX, a marker of outer radial glia (oRG), than primary samples, and 
63% fewer EOMES* IPCs, as previously noted*”’. We also found a 94% 
reduction in the number of SATB2* upper layer neurons in organoids 
compared to primary samples (Fig. lb, Extended Data Fig. 3). 

To quantitatively compare cell types in primary and organoid sam- 
ples, we performed correlation analysis of marker genes (see Meth- 
ods) based upon our clusters in each dataset. We categorized each 


pluripotent stem cell lines and three protocols with varied levels of directed 
differentiation generates similar cell types to primary cortex, but the 
population proportions differ. The proportion of cells for each marker in each 
sample type are: SOX2’ (primary 15.4%, organoid 41.2%), HOPX® (primary 7.6%, 
organoid 4.2%), EOMES’ (primary 4.1%, organoid 1.5%), NEUROD6’ (primary 
51.9%, organoid 20.3%), SATB2* (primary 32.5%, organoid 2.0%), and DLX6-ASI* 
(primary 17.1%, organoid 3.5%). 


cluster in terms of its class (neuronal or non-neuronal), cell-cycle state 
(dividing or postmitotic), type (radial glia, excitatory neuron and so 
on) and subtype (for example, oRG, layer IV excitatory neuron), and 
quantified the correlation between organoid and primary cell catego- 
ries. Neural class and proliferative state were largely preserved, as has 
been previously reported?**3, However, cell types (P=1.02 x 10°) and 
subtypes (P= 5.34 x 10°’) were significantly less well correlated to all 
organoid-derived cells, regardless of protocol (Fig. 2a). Our correlative 
analysis across all published organoid datasets suggested that anumber 
of radial glia or neuronal clusters corresponded equally well to multiple 
primary cell subtypes and thus were designated as ‘pan-radial glia’ or 
‘pan-neuronal’. Lack of subtype resolution resulted ina smaller number 
of high-quality subtypes in organoids compared to primary samples 
(see Methods; Fig. 2a). We validated our observation of limited subtype 
specificity between datasets using five additional batch correction 
methods and observed little overlap between organoid and primary 
clusters (Extended Data Fig. 8, Supplementary Table 5). 


Organoid radial glia lack specificity 

The differentiation program that generates neurons from radial glia 
is highly conserved” *°, and we sought to identify genes that strongly 
discriminate progenitors from neurons. We were surprised to find 
that primary cell types are defined by more than twice as many genes 
as organoid cells, and that type-defining genes largely did not over- 
lap between datasets (Fig. 2b, Extended Data Fig. 9, Supplementary 
Table 6). We used a gene score metric that quantifies the degree of 
enrichment and specificity for each marker gene ina dataset (Methods), 
which is initially low in primary cells but increases substantially over 
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Fig. 2|Molecular comparisons of cell subtypes between primary and 
organoid samples. a, Each cluster was classified by marker genes for class, 
state, type and subtype (primary: n=5 individuals across independent 
experiments; organoids: n=37 organoids from 4 PSC lines across 4 
independent experiments). Heat map shows correlation between pairwise 
combinations of marker genes (red intensity: Pearson’s correlation from-1to 
1). First histogram indicates cell subtypes in primary (orange) and organoid 
(blue) samples. Second histogram shows quantitative correlation fromthe 
best match for each category averaged across clusters (mean +s.d.; subtype 
versus type, **P= 0.0073; subtype versus state, ****P= 0.00008; subtype 
versus class, ****P= 0.003; Welch’s two-sided t-test). b, Marker specificity of 
primary and organoid cluster markers. Using VariancePartition, the genes 
defining metadata properties were evaluated for contribution to overall 
variance. Genes contributing >25% variance by cell type were used in the Venn 
diagram. Box and whisker (mean +s.d.) depicts level of specificity at class 
(****P=4.4 x10), state (****P=4.7 x10“), type (****P=1.02 x 10°) and subtype 


development (Extended Data Fig. 7e). Inall cases, organoids exhibited a 
significantly lower gene score that did not resolve over time (Extended 
Data Fig. 7e), suggesting that markers of progenitors and differenti- 
ated cells might be co-expressed (Fig. 2b). We plotted the normalized 
counts for each gene that discriminated neurons from radial glia in 
primary samples, finding that neurons had low expression of radial glia 
markers, and radial glia did not express neuronal markers. However, 
we found substantial co-expression of these markers in organoid cells, 
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level (****P=5.34 x 10°). Welch’s two-sided t-test; primary, n=5; organoids, 
n=37).Dot plot depicts genes that discriminate between radial gliaand neuron 
identity in primary samples. Each dot isa gene, shownas the average 
expression in radial glia (x-axis) and neuron (y-axis) for primary (orange) or 
organoid (blue) cells. c, Differential expression (two-tailed Wilcoxon rank sum 
test) between clusters annotated as oRG cells in primary and organoid datasets 
generated log,(fold change) (x-axis) and —log,)(adjusted P) (y-axis). Primary, 
n=5; organoids, n=37). A pseudocount of 500 was assigned to comparisons 
with an adjusted P= 0. Many measurements were significant, including 
expression of the oRG identity gene PTPRZ1. Week-8 organoids had minimal co- 
expression of PTPRZ1 and HOPxX (top), while at GW15 the outer subventricular 
zone (oSVZ) contains extensive co-localization (repeated independently 3x). 
White arrows, double-positive cells; yellow arrows, single-positive cells. Scale 
bar, 50 um. d, Differential expression (two-tailed Wilcoxon test) between cell 
clusters annotated as upper layer neurons. 


resulting in a lower correspondence between organoid and primary 
cell types and subtypes. 

We explored how well organoid radial glia recapitulated their pri- 
mary cell subtype counterparts at the transcriptomic level by focusing 
the comparison on oRG cells. Anumber of genes were more highly 
expressed in organoid oRGs than in primary oRGs, and these genes 
were largely related to glycolysis or endoplasmic reticulum (ER) stress 
(Supplementary Table 7). One of the genes that was most highly 
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Fig. 3 | Maturation of cortical lamina and radial glia. a, 
Immunohistochemistry of SOX2* progenitors, HOPX* oRG cells, CTIP2* deep 
layer neurons and TBR2’ IPCs in primary and organoid samples showing 
laminar structure during neurogenesis. Primary samples express SOX2 and 
TBR2in the ventricular zone and CTIP2 inthe cortical plate at GW13. By GW15, 
HOPX’ oRGs are born and reside in the oSVZ. The cortex expands markedly over 
the following weeks with more HOPX’ oRG cells residing in the oSVZ, providing 
ascaffold on which neurons migrate. Organoids express similar markers to 
GWI13 samples by week five of differentiation with multiple ventricular zone- 
like structures. oRG cells arise and increase between weeks eight and ten. The 
radial architecture expands and dissolves over this period. By week 15, a mix of 
cell types is present in the organoid. Organoids shown were differentiated 
using the ‘least directed’ differentiation protocol and staining was validated 
independently three times (primary: n=4 biologically independent samples; 
organoid: n=3 biologically independent samples). Scale bar, 50 uM. b, 
Pseudoage was calculated by identifying networks froma10,000-cell subset of 
primary radial glia that highly correlated to age (either positively or 
negatively). These networks were then collapsed into a single ‘age network’. 
The module eigengene for this age network was then calculated onthe 
remaining data and used for pseudoage. Pseudoage is indicated by the graph 
line and shading represents the geometric density standard error of the 
regression. The primary dataset (orange) has a high Pearson’s correlation and 
R? value, while the organoid dataset has no correlation to the pseudoage 
metric. 


upregulated in primary oRG cells, but had very low expression in orga- 
noid oRG cells, was PTPRZ1, aknown oRG marker” (Fig. 2c). To validate 
this finding, we stained primary and organoid samples for PTPRZ1 
and HOPX, a canonical oRG marker”, and found that primary tissue 
contained more HOPX and PTPRZ1 co-expressing cells than did orga- 
noids (Fig. 2c, Extended Data Fig. 9e). We performed similar differential 
expression analysis between upper layer neuron clusters, and found 
that two genes required for neuronal maturation and projection pattern 
specification, MEF2C and SATB2*, were substantially upregulated only 
in primary cells (Fig. 2d). Even when cellular subtypes can be assigned 
to organoids, they lack molecular subtype identifiers. 


Cell maturation is impaired in organoids 

The progression of developmental events, such as the birth of neu- 
ronal and glial cell types, occurs more rapidly in organoids than in 
primary tissue, and progenitor and neuronal zones do not expand as 
broadly as in vivo (Fig. 3a). A primitive radial glial scaffold is observed 


at 5 weeks of differentiation in organoids, whereas the oRG scaffold 
expands predominantly after 15 weeks of development inthe primary 
human cortex. Over the course of 15 weeks of differentiation, the orga- 
noid progenitors differentiate, the ‘scaffold’ dissolves and intermixed 
populations of neurons and glia develop. Using our transcriptomic data, 
we sought to explore how cellular maturation was affected as a result 
of the faster temporal development we observed cytoarchitecturally 
in organoids. To explore the maturation of progenitor cells, we used 
weighted gene co-expression network analysis (WGCNA”*) to generate 
gene modules of strongly correlated genes in primary radial glia. We 
consolidated networks that correlated with sample age into a pseudo- 
age metric and then correlated pseudoage with actual age, observing 
a strong positive correlation in primary radial glia (Fig. 3b, Extended 
Data Fig. 10, Supplementary Table 8). With confidence in our networks, 
we applied them to the organoid radial glia. We saw limited correla- 
tion between organoid pseudoage and actual age, suggesting that the 
molecular maturation programs that exist in vivo are not activated 
in organoids. Notably, this heterogeneous maturation level existed 
within each organoid (Extended Data Fig. 10c), indicating that there is 
variability between individual cells and notjust across organoids, lines 
or batches. The lack of a radial glia molecular maturation signature in 
organoids correlates with the absence of molecular diversity in this 
model. The effect of radial glia subtype and maturation on the role 
of these cells as neural precursors is unknown, but the dysregulation 
of these programs in organoids may affect their ability to completely 
recapitulate differentiation trajectories of cortical neurons in vivo. 


Definition of cortical areal signatures 


Recent studies have uncovered molecular differences between excita- 
tory neurons across cortical regions”’”’, and these differences may 
emerge during neurogenesis”. Given that regional specification may 
represent a central feature of neuronal identity, we investigated how 
the molecular properties of areal identity emerge. We leveraged pri- 
mary cell data collected from seven cortical regions. For genes that 
were uniquely enriched in each region, we calculated a weighted 
average expression (eigengene) across primary and organoid cells 
(Supplementary Table 9). In primary cells, some signatures, such as 
those from the PFC, temporal lobe, hippocampus and V1, were highly 
enriched in their respective areas (Extended Data Fig. 11). Notably, the 
parietal lobe tracked closely with the temporal lobe and the somatosen- 
sory and motor cortex co-expressed signatures, suggesting a lack of 
areal segregation between these regions at the time points sampled 
(Extended Data Fig. 11c). The earliest samples in our dataset preceded 
the development of anatomical distinctions between cortical regions, 
and thus could not be subdissected. The early ‘telencephalon’ samples 
were highly enriched for V1 signatures, but additional work is required 
to clarify whether excitatory neurons born early in development all 
begin by expressing V1 areal genes, or if this was a sampling artefact 
of our dissections. These data offer a new categorization of cortical 
area signatures and enable us to evaluate the areal identities of corti- 
cal organoid neurons. 


Areal signatures reflected in organoids 

Our analysis indicates that many aspects of neuronal subtype are not 
preserved or are averaged into a pan-neuronal identity in organoids. 
However, our primary data suggest that areal identity is an early marker 
of neuronal differentiation. Using areal signatures from primary cells, 
we were able to evaluate the closest areal identity for each excitatory 
organoid neuron profiled by scRNA-seq. We were surprised to discover 
that most neurons corresponded toa defined areal signature (Fig. 4b) 
despite the lack of thalamic input, whichis thought to refine areal iden- 
tity°?!, Although each organoid contained neurons with multiple areal 
identities, the strength of areal correspondence of organoid excitatory 
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Fig. 4 | Analysis of areal identity in organoid excitatory neurons. a, Each of 
seven cortical areas was used to generate a unique area gene signature by 
comparing expression with the other six areas. The unique signatures were 
considered networks, and module eigengenes across area networks were 
calculated for each primary and organoid cell. The area with the highest 
normalized eigengene (normalized to the highest score within each area for 
equal comparison) was designated as the areal identity of that cell. b, Cortical 
composition for organoids across protocols. Areal identity was assigned for 
each cell within an organoid and the areal composition is shown for the 37 
organoids in our dataset. Organoid samples are listed from earliest to latest 
stage collected (weeks 3-24). Within atime point, the organoid protocol used is 
ordered from least to most directed differentiation; each time pointis 
comprised of multiple PSC lines. Every organoid has heterogeneous areal 
expression.c, The average module eigengene score for each primary (orange) 
and organoid (blue) cell designated (primary) or assigned (organoid) PFC or V1 
identity (primary, n=5 independent samples across 5 experiments; organoid, 
n=37 organoids across 4 independent experiments). The average value for PFC 
was not significantly different between organoid and primary, and the V1 
organoid cells had higher correlation to the V1 signature than primary cells, 
indicating that areal identity in the organoid strongly resembles normal 
development (box plots: centre line shows mean, box limits show range and 
whiskers show standard deviation; two-sided Welch’s t-test, P=0).d, Validation 
of intermixing of areal identities in organoid samples differentiated using the 
least directed differentiation protocol. Inthe PFC, BCL11B and SATB2 co- 
localize in the same cell, whereas in V1 cells they are mutually exclusive. Both 
patterns are in close proximity in the organoid. AUTS2 is arostrally expressed 
transcription factor whereas NR2F1is a caudally expressed factor, but they are 
adjacent in the organoid. Scale bar, 50 pM; representative image shown (n=3 
replicates each). 
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neurons was robust, including to regions such as PFC and V1 (Fig. 4c, 
Extended Data Fig. 11d). Regardless of the PSC line or differentiation 
protocol used, cortical organoids comprised heterogeneous areal 
identities. To investigate whether cells that corresponded to different 
areas were spatially segregated within an organoid, we performed 
immunohistochemistry for two sets of area-specific genes. PFC excita- 
tory neurons co-express the projection-specification transcription 
factors SATB2 and BCL11B (also known as CTIP2), and througha narrow 
topographical transition these markers segregate entirely in V1’. We 
explored the expression of these factors in our organoids, and observed 
both co-expression and segregation of SATB2 and CTIP2 in adjacent 
cells (Fig. 4d). AUTS2 and NR2F1 are well-described genes with rostral— 
caudal gradient expression patterns, and we similarly observed cells 
expressing either of these factors in proximal space. Together, these 
data suggest a model in which differentiation of cortical excitatory 
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neurons is strongly defined by areal identity, and organoids recapitulate 
this process, but without spatial organization. 


Cellular stress increases in organoids 


Modules related to the activation of glycolysis and ER stress are enriched 
in organoid cells‘, and additional analysis using four orthogonal 
co-clustering methods showed that stress pathways were upregulated 
in organoids across all protocols (Extended Data Figs. 12, 13). We con- 
firmed that several genes that were upregulated in organoid datasets’, 
including the glycolysis gene PGK1™ and the ER stress genes ARCNI1™ 
and GORASP2***>, were enriched at the protein level in organoids 
(Extended Data Fig. 12). Immunostaining verified that, regardless of the 
stage of organoid differentiation or PSC line used, there was increased 
expression of PGK1, ARCN1and GORASP2 in distinct organoid domains, 
not restricted to the organoid core. 

To probe the origin of this cellular dysregulation, we first evaluated 
the expression of stress genes during normal human cortical develop- 
ment. Fixed cryosectioned tissue samples showed little expression of 
these genes throughout peak neurogenesis (Extended Data Fig. 12c), 
thoughsome ER stress was observed at earlier cortical stages (Extended 
Data Fig. 13b, c). As ER stress and glycolysis genes are not canonically 
activated during cortical development, we hypothesized that the 
in vitro conditions of the organoid model resulted in increased cellu- 
lar stress. We first evaluated the activation of stress pathways in PSCs 
and were surprised to find expression of both ARCN1 and GORASP2, 
suggesting that ER stress occurs in stem cells before organoid forma- 
tion (Extended Data Fig. 12b). To determine the rate at which cellular 
stress arises in vitro, we cultured organotypic cortical slices for one 
week and observed negligible change in stress activation compared 
with acutely fixed samples. However, we did observe upregulation 
of ARCN1 and GORASP2 in primary dissociated cells after one week 
in culture (Extended Data Figs. 12, 13), suggesting that cellular stress 
may bea broader feature of in vitro culture. 


Organoid environment activates stress 


To test whether aggregate cell-culture conditions induced 
cellular stress, we transplanted GFP-labelled primary progenitors from 
GW14~-20 into organoids. After 2.5 weeks we observed GFP-labelled 
SOX2* HOPX* primary radial glia within the organoids (Fig. 5b, Extended 
Data Fig. 14). We isolated GFP* primary cells and performed scRNA-seq 
to compare pre- and post- transplanted cells to our primary reference 
dataset (Supplementary Table 10). We found a marked increase in the 
expression of the glycolysis gene PGK/ and the ER stress gene GORASP2 
in primary cells transplanted into, or generated within, organoids 
(Fig. 5c, Supplementary Table 11). 


Increased stress impairs cell subtype 


Mouse knockout studies have suggested that the activation of ER stress 
pathways can inhibit cell-type specification*®”’, so we investigated 
whether metabolic stress affected specification in transplanted primary 
cells. We noted similar subtypes when we compared pre-transplanta- 
tion cell clusters and our primary reference data. By contrast, post- 
transplanted primary cells had significantly lower subtype correlation, 
similar to organoid cells, and they lacked markers of specific progenitor 
or neuronal subtypes (Fig. 5b, Extended Data Fig. 14e). To test whether 
the differences were driven by the induction of stress pathways, we 
also generated 3D aggregates of dissociated primary cortical cells 
from GW14/15 and found a similar upregulation of metabolic stress 
genes (Extended Data Fig. 13e). However, we observed an intermediate 
phenotype in which subtype correlation was significantly (P= 0.0037) 
higher in primary aggregates than in post-transplanted primary cells, 
but still significantly lower than in pre-transplant cells (Extended Data 
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Fig. 5| Influence of culture on metabolic stress and cell type. a, Primary 
samples were progenitor-enriched, GFP-labelled and transplanted into 
organoids. After 2.5 weeks, GFP’ cells were isolated by fluorescence-activated 
cell sorting (FACS) and processed for scRNA-seq. IHC, immunohistochemistry. 
b, Primary GFP’ cells integrate into organoids differentiated using the least 
directed protocol. Scale bar, 50 uM. Pre-transplantation cells have similar 
profiles and cellular subtypes as primary data. After transplantation, thereisa 
decrease in subtype correlation (n=7 biologically independent samples across 
2 independent experiments). Mean +s.d. subtype correlation indicated on 
graph (P=2.8 x 107°, two-sided Welch’s f-test).c, After transplantation, primary 
cells show increased expression of the stress genes PGK1 (****P=1.76 x10*”, 
two-sided Student’s t-test) and GORASP2 (arrow; ****P= 9.60 x 10°, two-sided 
Student’s t-test) as indicated by width of coloured domain in each respective 
violin plot (n=7 samples across 2 experiments). Scale bar, 50 1M. d, Organoids 
were dissociated, GFP-labelled and injected into the cortex of P4 mice. After 


Fig. 14e). Some of these discrepancies might be attributable to the 
presence of other cell types, including microglia, endothelial cells 
and pericytes in the primary aggregate, that may promote normal 
maturation and differentiation. 


Transplantation rescues cell stress 

To determine whether an in vivo environment could rescue the cellular 
stress derived from organoid culture conditions, week-8 organoids 
were dissociated, virally labelled with GFP and transplanted into the 
cortices of postnatal day four (P4) mice (Fig. 5d). Two or five weeks 
after transplantation, organoid-derived cells could be visualized incor- 
porated into the mouse cortex (Fig. 5e, Extended Data Fig. 14f). After 
five weeks, organoid-derived cells had intricate morphologies and 
showed reduced expression of cellular stress markers. The glycolysis 
gene PGK1 and the ER stress gene ARCNI were not expressed, andtheER 
stress gene GORASP2 showed reduced expression compared to normal 
organoid conditions (Fig. 5f, Extended Data Fig. 14g). As organoid cells 
showed reduced stress after transplantation, we evaluated whether 
organoid-derived cells were capable of higher subtype specificity when 
removed from the in vitro environment. We isolated GFP* organoid cells 
two or five weeks after transplantation for scRNA-seq and compared 
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2-5 weeks, mouse brains were collected for scRNA-seq and immunostaining. e, 
Human cells are visualized by GFP and human nuclear antigen expression 
(n=13 mice transplanted with 14 organoids derived from 2 induced PSC lines 
across 2 independent experiments). Organoid cells express markers of 
progenitors (HOPX), neurons (CTIP2 and SATB2) and astrocytes (GFAP). Mouse 
vascular cells (laminin and CD31) innervate the transplant. f, Post- 
transplantation, organoid cells show reduced expression of stress genes. 
Scatter plots show decreased staining intensity in transplanted organoids 
(error bars, s.d.) of PGK1 (****P= 9.64 x 10°”), ARCNI (****P=1.28 x 10°) and 
GORASP2 (****P=1.88 x 10°; n=19 sections from 6 transplanted mice across 2 
experiments, each marker stained independently; two-sided Student’s ¢-test). 
Violin plots show a decrease in PGK1 (****P= 2.16 x 10”) and GORSASP2 

(**P= 0.0019) expression in organoid cells post-transplant from single-cell 
analysis (n=1,980 cells from 7 transplanted mice across 2 experiments, two- 
sided Welch’s f-test). 


pre- and post-transplantation organoid-derived cells to our primary 
reference. We noted increased cell subtype specification of both oRG 
cells and newborn neurons (Extended Data Fig. 14h), suggesting that 
metabolic stress contributes to specification deficiencies in organoid 
cells (Supplementary Discussion). 


Conclusions 

We have provided a comprehensive molecular characterization of 
developing human cortical cell types and their preservation in brain 
organoid models. Using single-cell transcriptomics, we have identified 
broad cell classes and types, as well as fine-grained subtypes such as 
outer radial glia progenitors in primary human samples. Compared to 
primary tissue, organoids contain a smaller number of cell subtypes 
and their cells often co-express marker genes, resulting in broad type 
assignment, such as pan-radial glia or pan-neuron. We have used this 
dataset to generate pseudoage metrics and provide in-depth analysis 
of area-specific gene signatures and their developmental trajectories 
in primary and organoid neurons. Finally, we have identified a role 
for stress pathway activation in the impaired subtype specification of 
cortical organoid cell types; the lack of specificity in organoids must 
be carefully considered when studying developmental processes, 
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cell-type-specific disease phenotypes or cellular connectivity. In addi- 
tion, metabolic stress in utero could lead to molecular identity changes, 
with potential consequences for human brain development. Overall, 
our compilation of raw and analysed data, paired with visualization of 
single-cell clustering ina cell browser, provides a valuable resource for 
better understanding of normal human development and to benchmark 
the fidelity of in vitro cellular data. 
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Methods 


No statistical methods were used to predetermine sample size. 
The experiments were not randomized and the investigators were not 
blinded to allocation during experiments and outcome assessment. 


PSC expansion culture 

The human induced PSC lines H28126 (Gilad Laboratory, University of 
Chicago), 13234 and WTC10 (Conklin Laboratory, Gladstone Institutes), 
which were previously authenticated*, and the embryonic stem cell 
line H1 (WiCell, authenticated at source), were expanded on matrigel- 
coated six-well plates. Cells tested negative for mycoplasma. Stem 
cells were thawed in StemFlex Pro Medium (Gibco) containing 10 uM 
Rock inhibitor Y-27632. Medium was changed every other day and lines 
were passaged when colonies reached about 70% confluency. Stem 
cells were passaged using PBS-EDTA and residual cells were manually 
lifted with cell lifters (Fisher). All lines used for this study were between 
passage 25 and 40. 


Cortical organoid differentiation protocols 

Cortical organoids were differentiated using three directed differentia- 
tion protocols. In brief, PSC lines were expanded and dissociated to sin- 
gle cells using Accutase. After dissociation, cells were reconstituted in 
neural induction medium at 10,000 cells per well in a 96-well v-bottom 
low-adhesion plate. After 18 days, organoids from all protocols were 
transferred from 96-well to 6-welllow-adhesion plates and moved onto 
an orbital shaker rotating at 90 rpm. Throughout the culture duration 
organoids were fed every other day. Organoids were collected for immu- 
nohistochemistry and scRNA-seq after 3, 5, 8 or 10 weeks of culture. 

For the least directed differentiation protocol', GMEM-based 
induction medium included 20% knockout serum replacer (KSR), 1x 
non-essential amino acids, 0.11 mg/ml sodium pyruvate, 1x penicil- 
lin-streptomycin, 0.1 mM B-mercaptoethanol, 5 uM SB431542 and 3 
LMIWRI1-endo. Medium was supplemented with 20 pM Rock inhibitor 
Y-27632 for the first 6 days. After 18 days, the medium was changed to 
DMEM/F12 medium containing 1x glutamax, 1x N2, 1x CD lipid concen- 
trate and 1x penicillin-streptomycin. After 35 days, organoids were 
moved into DMEM/F12-based medium containing 10% FBS, 5 pg/ml 
heparin, 1x N2, 1x CD lipid concentrate and 0.5% matrigel (BD). After 
70 days, the medium was additionally supplemented with 1x B27 and 
the matrigel concentration was increased to 1%. 

In the directed differentiation protocol*®, the induction medium 
consisted of GMEM including 20% KSR, 1x non-essential amino acids, 
0.11 mg/ml sodium pyruvate, 1x penicillin-streptomycin and 0.1 mM 
B-mercaptoethanol supplemented with 5 uM SB431542, 3 uM IWRI1- 
endo and 2 uM dorsomorphin. From days 9 to 25 small molecules were 
removed and the induction medium was instead supplemented with 
10 ng/ml EGF and 10 ng/ml FGF. After 25 days the medium was changed 
to DMEM/F12 medium containing 1x glutamax, 1x N2, 1x CD lipid con- 
centrate and 1x penicillin-streptomycin. After 35 days, organoids were 
moved into DMEM/F12-based medium containing 10% FBS, 5 pg/ml 
heparin, 1x N2, 1x CD lipid concentrate and 0.5% matrigel (BD). After 
70 days, the medium was additionally supplemented with 1x B27 and 
the matrigel concentration was increased to 1%. 

The most directed” protocol used a DMEM/F12-based induction 
medium containing 15% KSR, 1x MEM-NEAA, 1x glutamax, 100 uM B-ME, 
100 nMLDN-1931839, 10 LM SB431542, and 2 uM XAV939. For the first 2 
days, the medium was supplemented with 50 uM Rock inhibitor Y-27632 
and 5% heat-inactivated FBS. After 10 days, organoids were moved into 
neuronal differentiation medium consisting of equal parts DMEM/F12 
and neurobasal medium containing 0.5% N2, 1% B27 without vitamin A, 
1% glutamax, 0.5% MEM-NEAA, 0.025% human insulin solution, 50 uM 
B-ME and 1% penicillin-streptomycin. After 18 days, organoids were 
maintained in maturation medium containing equal parts DMEM/F12 
and neurobasal medium with 0.5% N2, 1% B27, 1% glutamax, 0.5% NEAA, 


0.025% human insulin solution, 50 uM B-ME, 20 ng/ml BDNF, 200 uM 
cAMP and 200 uM ascorbic acid. 


Immunohistochemistry 

Cortical organoids and primary human cortical tissue samples were 
collected, fixed in 4% PFA, washed with 1x PBS and submerged in 30% 
sucrose in 1x PBS until saturated. Samples were embedded in cryomolds 
containing 50% OCT (Tissue-tek) and 50% of 30% sucrose in 1x PBS 
and frozen at —80 °C. Primary samples were sectioned at 20 uM and 
organoids at 16 uM onto glass slides. Antigen retrieval was performed 
on tissue sections using a citrate-based antigen retrieval solution at 
100x (Vector Labs) which was boiled to 95 °C and added to slides for 
20 min. After antigen retrieval, slides were briefly washed with PBS and 
blocked with PBS containing 5% donkey serum, 2% gelatin and 0.1% 
Triton for 30 min. Primary antibodies were incubated in blocking buffer 
on slides at 4 °C overnight, washed with PBS containing 0.1% Triton 
three times and then incubated with AlexaFluor secondary antibod- 
ies (Thermo Fisher) at room temperature for 2 h. Primary antibodies 
included mouse: SOX2 (Santa Cruz, 1:500, sc-365823), HOPX (Santa 
Cruz, 1:250, sc-398703), SATB2 (Abcam, 1:250, ab51502), AUTS2 (Abcam, 
1:100, ab243036), human nuclei (Millipore, 1:500, MAB1281); rabbit: 
HOPx (Proteintech, 1:500, 11419-1-AP), GORASP2 (Proteintech, 1:50, 
10598-1-AP), ARCNI1 (Proteintech, 1:50, 23843-1-AP), PGK1 (Thermo 
Fisher 1:50, PAS-13863), PTPRZ1 (Atlas, 1:250, HPAO15103), NR2F1 
(Novus, 1:100, NBP1-31259); rat: CTIP2 (Abcam, 1:500, ab18465); sheep: 
EOMES (R&D, 1:200, AF6166); guinea pig: NEUN (Millipore, 1:500, 
ABN9O); and chicken: GFP (Aves, 1:500, GFP-1020). 


Primary sample collection 

All primary tissue was obtained and processed as approved by the UCSF 
Human Gamete, Embryo and Stem Cell Research Committee (GESCR, 
approval 10-05113). Allexperiments were performed in accordance with 
protocol guidelines. Informed consent was obtained before sample 
collection for the use of all tissue samples within this study. First and 
second trimester human cortex tissue was collected from elective 
pregnancy termination specimens from San Francisco General Hospital 
and the Human Developmental Biology Resource (HDBR). Tissue was 
collected only with previous patient consent for research and in strict 
observation of legal and institutional ethical regulations. 


Dissociation 

Primary human cortical samples were dissociated using papain 
(Worthington) containing DNase. Samples were grossly chopped and 
then placed in 1 ml papain and incubated at 37 °C for 15 min. Samples 
were inverted three times and incubation continued for another 15 
min. Next, samples were triturated by manually pipetting with a glass 
pasteur pipette approximately ten times. Dissociated cells were spun 
down at 300g for 5 min and papain removed. 


10x capture and sequencing 

Single-cell capture from live cells was performed following the 10x 
v2 Chromium manufacturer’s instructions for both primary and 
organoid samples. For primary samples, each sample was its own 
batch. For organoid samples, batch is indicated inthe metadata anno- 
tation in Supplementary Table 1. In each case, 10,000 cells were tar- 
geted for capture and 12 cycles of amplification for each of the cDNA 
amplification and library amplification were performed. Libraries 
were sequenced as per manufacturer recommendation ona NovaSeq 
S2 flow cell. 


Clustering 

We first explored the cell-type identities of primary and organoid sam- 
ples using Louvain-Jaccard clustering’’*®. Prior to clustering, batch 
correction was performed in a similar way to previous approaches”. 
In brief, each set of cells within a batch was normalized to the 
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highest expressing gene, making the range of expression from O to 1. 
These values were multiplied by the average number of counts within 
the batch. These normalized datasets were piped into Seurat v.2*°, 
in which cells with fewer than 500 genes per cell or more than 10% of 
reads aligning to mitochondrial genes were discarded. Normalized 
counts matrices were log,-transformed, and variable genes were 
calculated using default Seurat parameters. Data were scaled in the 
space of these variables, and the batch was regressed out. Principal 
component analysis was performed using FastPCA, and significant 
principal components were identified using a published formula’®. In 
the space of these significant principal components, the k=10 nearest 
neighbours were identified as per the RANNR package. The distances 
between these neighbours were weighted by their Jaccard distance, 
and Louvain clustering was performed using the igraph R package. 
If any clusters contained only one cell, the process was repeated with 
k=11and upwards until no clusters contained only one cell. Cluster 
markers and t-SNE plots were generated with Seurat package default 
parameters. 


Cell-type annotations 

Primary cell-type annotations of clusters were performed by com- 
parison to previously annotated cell types, and when a repository of 
substantial matching was not available, a combination of literature- 
based annotation of layer or maturation stage identity was used. The 
genes used to annotate each cluster are highlighted in Supplementary 
Table 2. When a cluster was substantially enriched based upon an age 
or an areal metadata property, this empirical observation was used to 
inform the annotation. Organoid cell types were first annotated by their 
similarity to primary cell clusters; if the correspondence was at or above 
0.4 and only one primary cell type had sucha high correspondence, 
the primary cell type was applied to the organoid cluster. If the corre- 
spondence was between 0.2 and 0.4 and included only one similarity, 
that cell type was used to identify the organoid cell type unless there 
was an obvious discrepancy in top marker gene expression between 
the two clusters. Ifno correlation was above 0.2, literature annotations 
or unknown identities were assigned. If an organoid cluster correlated 
equally well (within 10%) with multiple primary subtypes of the same 
or similar cell type, ‘pan’ identity was assigned. Low-quality cell types 
for all analyses were assigned when markers were dominated (>60%) by 
mitochondrial genes, ribosomal genes, or pseudogenes. Occasionally, 
anintersection of these approaches was used for organoid clusters, as 
indicated in Supplementary Table 3. 


Correlation analysis 

Correlation analysis was generated in the space of marker genes. For 
each marker gene, a specificity score was calculated. This score equalled 
the ‘enrichment’ (log,(fold change) of the marker compared to other 
clusters) and the ‘specificity’ (the percentage of the relevant cluster 
expressing the marker divided by the percentage of other clusters 
expressing the marker). These two values were multiplied by one 
another to obtain the final score, and this was represented across all 
marker genes for each sample in box-and-whisker plots. A matrix of 
all markers across all clusters was created for each individual dataset; 
ifa marker was not expressed at all in a certain cluster, it was marked 
as O. If a value was divided by 0 to calculate the score, the score was 
placed as adummy score at 1,500. Matrices between comparisons 
were correlated in the space of overlapping marker gene space using 
Pearson’s correlations. 


Co-clustering analysis 

Each of the five batch correction methods was performed with 
default parameters. For each analysis, the same 20,000 cell subset of 
each organoid and primary cells was used because most of the algo- 
rithms were too computationally intensive to perform on the full 
dataset. 


Linear mixed models 

VariancePartition“ was used for linear mixed model analysis. Analysis 
was performed in a randomized subset of 50,000 genes in the space 
of expressed genes across the metadata properties noted in Extended 
Data Fig. 9. Age was used as a continuous variable and all other variables 
were assigned as discrete. 


WGCNA and maturation analysis 

WGCNA networks were calculated as previously described” in 10,000 
randomly chosen primary radial glia cells and in parallel from 10,000 
randomly chosen organoid radial glia cells. These networks were 
applied to the remaining primary and organoid cells using the Modu- 
leEigengene function from the WGCNAR package. Pseudoage was cal- 
culated by taking networks that correlated highly to age inthe 10,000 
cell subset and combining their genes into a single gene set. Principal 
component analysis was performed in this gene space in the full space 
of radial glia and the loading of the first principal component dictated 
the pseudoage. This analysis was performed reciprocally. 


Area signatures 

Area signatures were obtained by performing pairwise differential 
expression between each of the seven cortical areas and the six remain- 
ing areas. Differential expression across all of the areas was combined, 
with a count of how many times a gene was differentially expressed 
in an area from each of the pairwise comparison. While combining 
the lists, the enrichment and specificity were averaged across all six 
analyses and multiplied by the number of times the gene appeared asa 
marker for an area of interest. This value, the ‘area specificity score’, was 
compared across all areas. For any genes that were considered markers 
of multiple areas, the area with the highest area specificity score was 
allocated to the gene asa marker, thus making all area markers unique 
to one area alone. This is how some areas have a higher percentage 
of cells assigned to another area other than their area or origin, and 
enables cleaner comparison of areal pattern emergence. Each set of 
area marker genes were designated as a network, and the correlation 
of each cell to this area was calculated by applyModules and calculating 
amodule eigengene. After assignment, to normalize unequal module 
eigengene distributions, within a dataset the module eigengenes were 
normalized by area and the assigned area for a cell was the area for 
which that cell had the highest module eigengene. 


PSA-NCAM protocol and viral infection of primary cells 

Primary cortical samples were grossly dissected to isolate the 
ventricular and subventricular zones excluding cortical plate neurons. 
Dissociated cells were enriched for neural progenitor cells using a PSA- 
NCAM antibody and the MACs magnetic sorting kit (Miltenyi Biotech). 
In brief, dissociated cells were incubated with the PSA-NCAM antibody 
for 30 min at room temperatre, washed with PBS containing 0.1% BSA 
and added to an equilibrated magnetic L column. Cells positive for 
antibodies bind to the column and negative cells are collected in the 
elute. The negatively sorted cells were pelleted at 300g for 1Omins 
and supernatant removed. Negatively sorted samples were infected 
with a CMV::GFP adenovirus (Vector Biolabs), which preferentially 
labels progenitors, at 37 °C for 15 min. Cells were spun down for 5 min 
at 300g and reconstituted in 500 pl medium. Cells were counted and 
50,000 primary cells isolated for transplantation in 100 pl medium. 


Primary cell transplantation into organoids 

Week-10 or -15 organoids made from the 13234 induced PSC line using 
the least directed differentiation protocol were placed at the air-liquid 
interface on Millicell (Millipore) inserts to limit movement. Fifty thou- 
sand primary cortical cells were reconstituted in medium containing 
DMEM/F12, 5% FBS, 1x N2 (Thermo Fisher), 1x B27 (Thermo Fisher), 1x 
penicillin-streptomycin (Thermo Fisher) and CD lipid concentrate 


(Thermo Fisher). Cells were slowly pipetted on top of the semi-dry 
organoids and left to integrate into the organoids for 30 min at 37 °C. 
Afterwards, organoids were gently lifted off of inserts by increasing 
medium volume by 3 ml. After 2 days, organoids were transferred toa 
new 6-well plate without inserts and the medium supplemented with 
1% GF-reduced matrigel and 1x amphotericin B. 


FACS purification of GFP’ cells 

Cells were dissociated into a single-cell suspension as described above. 
Cell suspensions were triturated and placed ontop of 4 ml 22% Percoll. 
Tubes containing Percoll and cell suspension were spun at 500g for 10 
min without break. The supernatant was discarded, and the cell pellet 
resuspended in HBSS with BSA and glucose. Cells were sorted using a 
Becton Dickinson FACSAria using 13 psi pressure and a 100-m noz- 
zle aperture. All FACS gates were set using unlabelled cells. Data were 
analysed post hoc for enrichment percentages with FlowJo software. 


Maintenance of transplanted organoids 

Organoids were maintained in 6-well low-adhesion plates in DMEM/12 
with glutamax (Thermo) medium containing 10% FBS (Hyclone), 1% 
GF-reduced matrigel (Corning), 1x N2 (Thermo Fisher), 1x B27 (Thermo 
Fisher), 1x CD lipid concentrate (Thermo Fisher), 5 pg/ml heparin, 1x 
penicillin-streptomycin, and 1x amphotericin B (Gibco). The medium 
was changed every other day for the duration of culture. Transplants were 
collected for immunohistochemistry at weeks 1, 2.5, 4. and 6 post-trans- 
plant. At week 2.5, paired organoid samples were FACS-sorted for GFP+ 
cells and captured cells were collected for single-cell RNA sequencing. 


Organoid cell transplantation into mice 

Mouse experiments were approved by UCSF Institutional Animal Care 
and Use Committee (IAUCUC) protocol AN178775-01 and performedin 
accordance with relevant institutional guidelines. Organoids fromthe 
H28126 and 13234 induced PSC lines were differentiated using the least 
directed protocol described above. Week-7/8 organoids were dissoci- 
ated and labelled with a CMV::GFP adenovirus for 30 min at 37 °C. Cells 
were pelleted and immediately transplanted into postnatal NSG mice 
(NOD.Cg-Prkdc@ Il2rg'™™""/SzJ, stock No:005557) at four days of age 
(P4). Using a stereotaxic rig, either the PFC or V1 of the left hemisphere 
was localized and 10,000 cells were transplanted into the cortex at each 
injection site. Two transplantations or injections, 0.5 mm apart, were 
made per mouse within the same cortical area of 10,000 cells each. Mice 
were allowed to develop for either 2 or 5 weeks post-transplantation 
before being euthanized. A total of 13 mice were used (7 male, 6 female), 
and nostatistical method was used to determine this sample size. Mice 
were euthanized, and the brain was extracted and grossly dissected 
using GFP expression to visualize relevant areas for collection. Tissue 
with GFP expression was dissociated for 30 min at 37 °C using papain, 
manually triturated, centrifuged and resuspended in HBSS. Cells were 
sorted for GFP using the FACS strategy described previously. After 
FACS isolation, GFP* cells were used for scRNA-seq using the 10x V2 
platform. Mice from the same experiment were collected in parallel 
and perfused with 10 ml PBS followed by 10 ml 4% PFA before the brains 
were extracted. Mouse brains were further fixed overnight at 4 °C, 
washed in 1x PBS three times for 30 min each, and rocked overnight at 
4°Cin30% sucrose in PBS. Brains were embedded in 50/50 mix of 30% 
sucrose and OCT before being cryosectioned. Mouse studies were not 
randomized or blinded prior to analysis. 


Organotypic slice culture 

Primary cortical tissue was maintained in CO,-bubbled artificial cer- 
ebral spinal fluid until being embedded in a 3% low melt agarose gel. 
Embedded tissue was live-sectioned at 300 um using a vibratome 
(Leica) and plated on Millicell (Millipore) inserts in a 6-well tissue cul- 
ture plate. Slices were cultured at the air-liquid interface in medium 
containing 32% Hanks BSS, 60% BME, 5% FBS, 1% glucose, 1% N2 and 


1% penicillin-streptomycin-glutamine. Slices were maintained for 7 
days in culture at 37 °C and the medium was changed every third day. 


Primary cortical aggregates 

Primary human cortical samples from gestational weeks 14 and 15 were 
gross-dissected at the outer subventricular zone, removing the cortical 
plate. Samples were dissociated using papain as described previously. 
Samples were aggregated in 96 v-bottom low-adhesion plates (S bio) 
containing 20,000 cells per well. They were aggregated in DMEM/F12 
with Glutamax (Thermo) based medium containing 10% FBS (Hyclone), 
1x N2 (Thermo Fisher), 1x CD lipid concentrate (Thermo Fisher), 5 pg/ 
ml Heparin, 1x penicillin-streptomycin and 1x amphotericin B (Gibco). 
Medium was supplemented with 20 uM Rock inhibitor for the first 
week. After 2 weeks, aggregates were transferred to 6-well low-adhesion 
plates and medium was supplemented with 1% matrigel and 1x B27. 


Dissociated primary cell culture 

Dissociated primary cortical cells were reconstituted in DMEM/F12- 
based medium containing 1x N2 (Thermo Fisher), 1x B27 (Thermo 
Fisher), 1x penicillin-streptomycin (Thermo Fisher) and 1x sodium 
pyruvate (Thermo Fisher). Cells were plated at one million cells per 
ml in 12-well matrigel-coated tissue culture plates. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Single-cell RNA sequencing data have been deposited in dbGAP for 
accession ‘A cellular resolution census of the developing human brain’ 
and in GSE132672. An interactive browser of single-cell data and raw 
and processed count matrices can be found at the UCSC cell browser 
website: https://organoidreportcard.cells.ucsc.edu. Source Data for 
Figs. 1-5 and Extended Data Figs. 1-14 are available online. Remaining 
source data can be retrieved directly from the single-cell data available 
in public repositories or from the UCSC cell browser website. 
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A Comparing Axes of Biological Variation of Development Between Primary Human Cortex and Cortical Organoids 
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Extended Data Fig. 1| Schematic of human cortex and human organoid 
development. a, Schematic of normal brain developmental trajectories 
queried in this study and their comparison to organoid models. Normal 
cortical development requires the emergence of a diversity of progenitor cell 


types froma seemingly uniform neuroepithelium. Through a sequence of cell- 


type specification and maturation, progenitor cells undergo neurogenesis and 
gliogenesis to generate the cellular diversity of the cortex. Areal identities are 
specified during this process and comprise a core property of developing 
neurons. 
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Extended Data Fig. 2| Brain and cortical organoid generation protocols. a, 


Cortical organoid protocols using different levels of directed differentiation 


were evaluated using scRNA-seq and immunohistochemistry. Stem cells were 


expanded on matrigel, dissociated to single cells, and re-aggregated in 
v-bottom low-adhesion plates. Small molecules were used to promote 
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forebrain induction and after 18 days cells were moved to 6-well plates onan 
orbital shaker. Organoids were maintained in culture and collected from weeks 
3 to 24. b, Protocol schematics for other methods used to differentiate whole 
brain and cortical organoids, which have published single-cell data. Publicly 
available data were used for comparative analyses with our collected data. 
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Extended Data Fig. 3 | Comparison of broad cell types across differentiation 
protocols. a, Organoids derived from the 13234 induced PSC line underwent 
the least and directed differentiation protocols, were collected at weeks 5 and 
10, and were processed for immunohistochemistry. Organoids from both 
protocols were stained with SOX2 to mark progenitors, HOPX to identify outer 
radial glia and TBR2 to label intermediate progenitor cells. Cultures were also 
stained with CTIP2 to mark deep layer neurons and SATB2 to identify upper 


layer neurons. At week S all progenitor subtypes were present. and by week 

10 both deep and upper layer neurons were detected. b, Organoids fromthe 
H28126 induced PSC line were differentiated using the least, directed and most 
directed protocols. All progenitor types marked by SOX2, HOPX and TBR2 and 
CTIP2* and SATB2‘ neuronal populations were present by week 5. Expression of 
all markers decreased by week 10. Organoid staining validation of broad cell 
types was repeated independently three times. 
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Clustering Analysis of Cortical Organoids 
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Extended Data Fig. 4| Single-cell comparison of cell types across samples. 
a, t-SNE plots depicting the single-cell analysis of primary cortical cells as 
coloured by cluster, age of sample and cortical area. Stacked histograms 
showing composition of each cluster for these metadata properties are also 
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included. b, ¢-SNE plots depicting the single-cell analysis of cortical organoid 
cells as coloured by cluster, protocol, pluripotent stem cell line (induced PSC or 
human embryonic stem cell) and age of sample. Stacked histograms showing 
composition of each cluster for these metadata properties are also included. 
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Cc Comparison of Published Organoid Datasets to Primary Human Brain 
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Extended Data Fig. 5 | Single-cell comparison of cell types across published correspondence shows the quantitative correlation from the best match in the 


datasets. a, Re-analysis of published single-cell sequencing in organoid heat map for each category of class, state, type and subtype, averaged across 
samples. t-SNE plotis coloured by cell-type designation, and the feature plots all clusters (primary: n=189,409 cells from five individuals collected 

depict the same cell populations as presented in Fig. 1. b, t-SNE plots depicting independently; published organoid data: n=109,813 cells from 7 datasets 

the single-cell analysis of published organoid cells as coloured by cluster, collected independently by different scientific groups; two-sided Welch’s t-test 
protocol (including paper of origin) and FOXG1expression.c,Recapitulationof evaluating mean+s.d.; subtype versus type, * P=0.0193; subtype versus state, 
the heat map in Fig. 2, using published organoid clusters from above and **P= 0.00017). 


comparing to primary reference dataset from this paper. Quantification of 
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Extended Data Fig. 6| See next page for caption. 
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Extended Data Fig. 6 | Single-cell comparison of cell types across published 
datasets. a, Re-analysis of published single-cell sequencing? inwhicha 
reproducible cortical organoid protocol was presented. The ¢-SNE plot is 
coloured by cell-type designation, and the feature plots depict the same cell 
populations as presented in Fig. 1. b, Recapitulation of the heat map in Fig. 2, 
using published organoid clusters*> and comparing to the primary reference 
dataset from this paper. Quantification of correspondence shows the 
quantitative correlation from the best match in the heat map for each category 
of class, state, type and subtype, averaged across all clusters (primary: 
n=189,409 cells from five individuals collected independently; organoid data®: 


n=166,241 cells from an independently collected dataset; two-sided Welch’s 
test was used to evaluate mean +s.d.; subtype versus state ****P=1.8 x10”). 

c, Pseudoage analysis of published organoids’ mirrors the organoids in this 
study with low correspondence between pseudoage and actual age. Pseudoage 
calculation is indicated by the graph line and shading represents the geometric 
density standard error of the regression. d, Areaidentity was assigned for all 
excitatory neurons from ref.* and each organoid consisted of heterogeneous 
areal identities, consistent with the observations in the organoids from this 
study. 
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Extended Data Fig. 7| See next page for caption. 
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Extended Data Fig. 7| Analysis of subtype correlation across metadata 
properties. a, Composition of each organoid by cell-type designation. FOXG1 
expression across all organoid samples is plotted by feature on the right. 

b, Comparison of organoid subtype as determined by this study versus three 
control analyses. Graphically, the column indicates subtype correspondence; 
error bar, s.d. The first analysis was performed by halving the primary dataset 
randomly and without overlap and then comparing the subclusters from the 
two datasets. This age- and method-matched analysis shows that primary 
variation is significantly lower than the variation between organoids and 
primary cells, as indicated by the significantly higher subtype correlation 
between primary datasets (organoids: n= 242,349 cells collected from 37 
organoids from 4 biologically independent samples from 4 independent 
experiments; primary data: 189,409 cells from 5 biologically independent 
samples from5 experiments; ****P = 2.0 x 10 *+, two-sided Welch’s t-test). A 
similar analysis was performed comparing the primary data from this study to 
data collected by microfluidic approaches”. Although the ages, capture 
method and number of cells varied greatly, subtype correlation between the 
published primary data and the data in this study is significantly higher than 
the subtype similarity between organoids and primary samples” (n=4,261 cells 
from 48 biologically independent samples across more than 35 independent 
experiments, ****P=2.0 x 10°). We additionally performed this analysis 
between two published datasets for cells from adult humans, comparing 
middle temporal gyrus* (MTG, n=15,928 cells) from an older adult with 
distinct brain regions from young adults in the control samples of astudy on 
autism spectrum disorder* (ASD, n=104,559). Despite differences across ages 
and individuals, who could be expected to have unique cortical gene- 
expression profiles based upon sensory experience, the distinct cortical 
regions isolated and the different capture methods, the subtype correlation 
between these two primary datasets is significantly higher than the correlation 
between organoid cells and primary cells (**P=0.0076).c, Subtype correlation 


as calculated and shown in Fig. 2, broken down by protocol and pluripotent 
line, in which bars indicate subtype correlation and error bars shows.d. The 
least directed protocol was significantly better at recapitulating cell subtype 
than the most directed protocol (*P= 0.0483, two-sided Welch’s t-test), 
consistent with recent findings®. We also observed that the induced PSC line 
1323_4 generated significantly more similar subtypes to primary samples than 
WTC10 or H1 (**P=0.0013 and 0.00839, respectively). d, Clustering and subtype 
analysis was performed between all organoids and primary PFC samples and 
primary V1individually. Subtype correlation did not change regardless of the 
area to which organoids were compared. ‘Overall’ refers to the subtype 
correlation observed when comparing all organoids cells to all primary cells 
and is shown for comparison. Histogram bars show subtype correlation and 
error bars shows.d. (n=242,349 cells from 37 organoids across 4 independent 
experiments). e, Subtype correlation analysis was performed across all 
organoid stages (n= week 3: 38,417 cells, week 5: 26,787 cells, week 8: 11,023 
cells, week 10: 50,550 cells, week 15: 2,722 cells, week 24: 4,506 cells from4 
independent experiments) and all primary ages (n = GW6: 5,970 cells, GW10: 
7,194 cells, GW14: 14,435 cells, GW18: 78,157 cells, GW22: 83,653 cells from5 
independent experiments). Histogram bars show subtype correlation and 
error bars shows.d. Week-3 organoids are more similar to younger primary 
stages, and week-15 organoids are most similar to older primary ages. Other 
ages correspond similarly well to the primary stages of peak neurogenesis 
(GW10-24), and altogether the organoids are most significantly similar to 
GW14 (**P=0.0015, two-sided Welch’s t-test). ‘Overall’ refers to the subtype 
correlation observed when comparing all organoids cells to all primary cells 
and is shown for comparison. The last histogram shows the average gene score 
of each sample and error bars shows.d. Younger primary samples and 
organoids have a relatively lower gene score related to their marker specificity; 
this specificity increases substantially over time in primary cells but less soin 
organoid cells. 
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Extended Data Fig. 8 | See next page for caption. 
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Extended Data Fig. 8 | Co-clustering of primary and organoid single-cell 
datasets with CCA, scAlign, LIGER and MetaNeighbour. a, Canonical 
correlation analysis from Seurat v3 was performed using reference-based 
integration. For this analysis, 20,000 cells were randomly subsetted from both 
the primary and organoid datasets and their counts matrices were merged. The 
primary samples were designated as the reference, and using CCA the organoid 
cells were projected into that reference space. AUMAP plot of the intersection 
is shown. The stacked histogram shows the relative contributions of each 
sample to each cluster. Most clusters were primarily one dataset or the other, 
validating the observations of limited primary subtype recapitulation in 
organoids. b, For the clusters with at least 20% contribution from both primary 
and organoid cells, differential expression was performed across all of these 
clusters jointly using a two-sided Wilcoxon rank-sum test. The full differential 
expression is presented in Supplementary Table 5, but genes upregulated in 
organoid cells were examined with Enrichr pathway analysis, and asummary of 
the top Gene Ontology terms are presented (organoid: n=20,000 cells from 37 
organoids across 4 independent experiments; primary: n=20,000 cells from5 
individuals across 5 independent experiments). c, Canonical correlation 
analysis from Seurat v3 was performed using the integration-based method. 
For this analysis, 20,000 cells were randomly subsetted from both the primary 
and organoid datasets and their counts matrices were merged. AUMAP plot of 
the intersection is shown. The stacked histogram shows the relative 
contributions of each sample to each cluster. Most clusters were primarily one 
dataset or the other, validating the observations of limited primary subtype 
recapitulation in organoids. d, For the clusters with at least 20% contribution 
from both primary and organoid cells, differential expression was performed 
across all of these clusters jointly using a two-sided Wilcoxon rank-sum test. 
The full differential expression is presented in Supplementary Table 5, but 
genes upregulated in organoid cells were examined with Enrichr pathway 
analysis, and asummary of the top Gene Ontology terms is presented 
(organoid: n=20,000 cells from 37 organoids across 4 independent 
experiments; primary: n=20,000 cells from 5 individuals across 5 independent 
experiments). e, scAlign was performed for integration of datasets. For this 
analysis, 20,000 cells were randomly subsetted from both the primary and 
organoid datasets and their counts matrices were merged. AUMAP plot of the 
intersection is shown. The stacked histogram shows the relative contributions 
of each sample to each cluster. Many clusters were primarily one dataset or the 
other, validating the observations of limited primary subtype recapitulationin 
organoids. f, For the clusters with at least 20% contribution from both primary 
and organoid cells, differential expression was performed across all of these 
clusters jointly using a two-sided Wilcoxon rank-sum test. The full differential 
expression is presented in Supplementary Table 5, but genes upregulated in 
organoid cells were examined with Enrichr pathway analysis, and asummary of 
the top Gene Ontology terms is presented (organoid: n=20,000 cells from 


37 organoids across 4 independent experiments; primary:n=20,000 cells 
from 5 individuals across 5 independent experiments). g, LIGER was performed 
for integration of datasets. For this analysis, 20,000 cells were randomly 
subsetted from both the primary and organoid datasets and their counts 
matrices were merged. AUMAP plot of the intersection is shown. The stacked 
histogram shows the relative contributions of each sample to each cluster. 
Although the clusters were well mixed, they had very diffuse marker gene 
expression suggesting key that biological drivers of variation were obscured by 
the analysis. h, For the clusters with at least 20% contribution from both 
primary and organoid cells, differential expression was performed across all of 
these clusters jointly using atwo-sided Wilcoxon rank-sum test. The full 
differential expression is presented in Supplementary Table 5, but genes 
upregulated in organoid cells were examined with Enrichr pathway analysis, 
and asummary of the top Gene Ontology terms is presented (organoid: 
n=20,000 cells from 37 organoids across 4 independent experiments; 
primary: n=20,000 cells from 5 individuals across 5 independent 
experiments). i, MetaNeighbour was performed using unsupervised analysis to 
compare the clusters from primary and organoid samples. MetaNeighbour 
uses cell-cell similarity scores based upon neighbour voting and AUROC 
calculations to quantify the similarities between cells. These pairwise values 
were used as an input to hierarchical clustering, and almost entirely segregated 
primary clusters from organoid clusters. Box-and-whiskers plot shows 
quantification of the similarities within organoid and primary datasets versus 
the comparison of the two showed that the primary alone comparisons were 
significantly higher (organoid to organoid:***P= 0.00078; primary to 
organoid:***P= 0.00036, two-sided Welch’s t-test) (organoid:n=20,000 cells 
from 37 organoids across 4 independent experiments; primary: n=20,000 
cells from 5 individuals across 5 independent experiments). The bars show 
range of subtype correlation with middle line indicating the mean and error 
bars the maximum and minimum. These results further validate our 
observations that there are important distinctions between the organoid and 
primary subtypes.j, The gene score for each of the four integration methods is 
presented, and all are significantly lower than for primary clustering alone 
(organoid subtype: ****P=5.3 x10 °°; CCA v3 Projected: ****P=5.5x10 *;CCA 
v3 Integrated: ****P = 2.8 x 10*; scAlign: ****P= 2.110; LIGER: 

***P=72 9 x 10°, two-sided Welch’s t-test). The one method that substantially 
integrated the samples (LIGER) had the lowest gene score. Box-and-whisker 
plot shows mean score and error bars show maximum and minimum 
(n=242,349 cells from 37 organoids across 4 independent experiments). The 
differentially expressed genes that were upregulated in primary samples from 
all four analyses are intersected. A substantial number of these genes were 
found in all four datasets, and these genes included examples that we identified 
from other methods in this study, including PTPRZ1, MEF2C and SATB2, 
validating the accuracy of our analytical methods and our main findings. 
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Extended Data Fig. 9 | Comparing cell-type specificationin primary and 
organoid samples. a, Variance partition was run on both primary and organoid 
datasets across the metadata properties shown. Each dot represents a gene 
and the amount of variance of that gene explained by the relevant metadata 
property. b, ChEA analysis of type genes identified in primary cortical samples. 
The x-axis shows the -log,,.(adjusted P) of the transcription factors indicated; 
results obtained from Enrichr datasets included a variety of experimental 
systems but have been shortened for ease of reading to the relevant 
transcription factor (n=189,409 cells from 5 biologically independent 
samples; two-sided Wilcoxon rank-sum test). Type genes in organoid samples 
were not unified for significant transcription factor regulation. c, Violin plots 
of radial glia and neuron markers in primary (orange) and organoid (blue) radial 


Excitatory Neuron Ra 


dial Glia 


Radial Glia Mean Expression 


glia and neurons in which width of coloured section indicates distribution of 
expression of each data point within a sample. Insome cases, organoids show 
expression of multiple markers, lower expression of key markers, or similar 
expression to that seenin primary samples (organoids: n=242,349 cells from 
37 organoids across 4 independent experiments; primary: n=189,409 cells 
from 5 biologically independent samples from 5 independent experiments). 
d, Dot plots from Fig. 2 shown with one colour only to avoid dot overlap. e, 
Lower-magnification images of PTPRZ1and HOPX overlap as shown in Fig. 2c 
show domains of overlapping expression inthe primary oSVZ and distinct 
domains of expression in the organoid ventricular zone. Validation stains were 
repeated independently three times. 
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Extended Data Fig. 10 | Molecular maturation analysis. a, WGCNA networks 
generated from annotated primary radial glia (Methods) were applied to both 
primary and organoid radial glia cells. Module eigengenes shown inthe 

heat map indicate overall higher expression in primary compared to organoid 
radial glia. b, Pseudoage (x-axis) versus actual age (y-axis) in PFC and V1 radial 
glia showing that PFC neurons are more mature than V1 radial glia. c, Box-and- 
whisker plot (minimum to maximum, bar at mean) across all cells withina single 
organoid from all organoids within this study show heterogeneity of 
maturation is within a single organoid and not between individuals (n =242,349 


cells from 37 organoids across 4 independent experiments). d, The parallel 
pseudoage analysis to the analysis in Fig. 3c is shown, but starting with 
organoid networks for the pseudoage calculation. Graph line shows mean 
pseduoage score against actual age, shading represents the geometric density 
standard error of the regression. The same pattern is observable, with 
organoids failing to recapitulate the molecular maturation of primary radial 
glia, though genes related to the switch from neurogenesis to gliogenesis are 
preserved and may account for some of the limited correlation. 
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Extended Data Fig. 11| Areal identification. a, Organoid areal assignments by 
age, line and protocol indicate heterogeneous areal identity. b, Heat maps 
showing normalized module eigengene signature of each areain primary 
samples (with known area onthe right) and in organoid samples. c, Summary of 
assigned area in primary samples compared to actual area. In many cases, they 
correspond strongly, and in others there is evidence of lack of distinction. For 
example, parietal cells still strongly express temporal signatures, suggesting 
that they have not yet been distinctly specified in primary samples, although 
this specification does exist in organoids. d, Box-and-whisker plot (minimum to 


maximum, bar at mean, error bars shows.d.) is the same comparison as shown 
in Fig. 4c, but across all areas (primary: n=122,958 excitatory neurons from 5 
individuals from 5 independent experiments; organoids: n= 97,531 excitatory 
neurons from 37 organoids from 4 biologically independent stem cell lines. In 
some cases thereis no significant difference between strength of area signal in 
primary cells and organoid cells (PFC, NS (not significant), P= 0.5373), in other 
cases either the primary or organoid sample is significantly stronger (motor: 
*P=0.0148; all other areas: ****P< 0.0001; Welch’s two-sided t-test). 
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Extended Data Fig. 12 | See next page for caption. 


Extended Data Fig. 12 | Glycolysis and ER stress across culture systems. a, 
Markers of metabolic stress are expressed across cortical organoid protocols. 
Violin plots show both data from our experiments (1-3) and published datasets 
from other protocols (4-12), which have significantly increased expression of 
the glycolysis gene PGK1 and the ER stress genes ARCN1 and GORASP2 
compared to primary samples (n=5 individual replicates, GW14 shown). Width 
of the colored area indicates mean gene-expression level of each dataset and 
overlaid dots show each individual data point. All protocols have significantly 
higher expression of these three markers compared to primary samples 
(****P=< 0.0001, two-sided Student’s f-test). b, Single-cell sequencing 
identified increased expression of genes in organoids, which was validated 


across all stages of organoid differentiation evaluated (weeks 3-14). 

Validation staining experiments were repeated independently three times. 
Representative images from week-14 organoids differentiated using the least 
directed differentiation protocol. Colonies of induced PSCs also express the ER 
stress markers ARCN1 and GORASP2 (n= 3 biologically independent samples 
across 3 experiments). Scale bar, 50 um. c, Primary cortical tissue express 
glycolysis and ER stress genes at undetectable levels (n= 3 biologically 
independent samples across 3 experiments). When tissue was cultured for one 
week, there was no significant increase in cellular stress (n =3 biologically 
independent samples across three experiments). Scale bar, 50 pm. 
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Extended Data Fig. 13 | See next page for caption. 


Extended Data Fig. 13 | Glycolysis and ER stress across experimental 
conditions. a, Metabolic stress network module eigengene expression across 
all cells is shown in box-and-whisker plots (minimum to maximum, bar at 
average, error bars shows.d.) across 11 datasets generated either inthis 
manuscript or from publicly available datasets. Data are shown for expressed 
genes from KEGG pathway glycolysis and ER stress networks. This study: 
n=242,349 cells from 37 organoids across 4 independent experiments; 
published datasets as annotated. b, The same box-and-whisker plots are shown 
for organoids (n= week 3: 38,417 cells, week 5: 26,787 cells, week 8: 11,023 cells, 
week 10:50,550 cells, week 15: 2,722 cells, week 24: 4,506 cells from4 
independent experiments) and all primary ages (n = GW6: 5,970 cells, GW10: 
7,194 cells, GW14: 14,435 cells, GW18: 78,157 cells, GW22: 83,653 cells from5 
independent experiments). ER stress and glycolysis networks decrease over 
time in primary samples but decrease less in organoids and are significantly 
higher in most organoid stages than in primary samples. Significance was 
calculated for each organoid sample with respect to each primary sample, and 
aone-sided Welch’s t-test was performed (to evaluate whether organoid 
expression was higher than primary). All comparisons were either not 
significant (ns) or significant with ****P< 0.0001.c, Cellular stress genes are 


expressed at low levels during human cortical development. GW13 and 17 
samples were stained for the glycolysis gene, PGK1, and showed little 
expression at either age. The ER stress gene ARCNI had little expression at 
either age, but there was modest expression of the ER stress gene GORASP2 at 
GWI13 that decreased by later neurogenesis. Staining validation studies were 
performed independently four times. d, Dissociated primary cells were 
cultured for one week. Across five independent studies, there was no 
detectable expression of the glycolysis gene PGK, but the ER stress genes 
ARCNI1 and GORASP2 showed significantly increased expression. e, 
Immunostaining of primary aggregates (n=5 biologically independent 
samples), which express markers of oRG cells (HOPX and SOX2), IPCs (TBR2) 
and neurons (CTIP2). Aggregates also had increased cellular stress indicated by 
PGK1, ARCN1and GORASP2 staining. Violin plots show expression level and 
data distribution for each marker in primary cells, primary cells after organoid 
transplantation and primary cells after being aggregated together. The 
expression of PGK1and GORASP2are increased in post-transplanted primary 
cells from the organoid as wellas in primary cell aggregates. Cell types and 
physical distribution inthe primary aggregate are shown. Scale bar, 50 mm. 
Representative image shown (n=3 replicates). 
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Extended Data Fig. 14 | See next page for caption. 


Extended Data Fig. 14 | Organoid transplantation at multiple time points. 

a, FACS plots showing dummy infection (left) and transplanted organoids 
(right) in terms of the their GFP signal (x-axis) versus sidesscatter (y-axis). Cells 
inthe gated region were collected (% of parent written on plot) and sequenced 
for transplantation 2.5 weeks after incubating in the organoid, representative 
plot shown onright, n=5.b, Immunohistochemical validation that cells 
infected with GFP virus were all SOX2-labelled progenitor in cells dissociated 
from primary cortical tissue GW14-20. Scale bar, 50 mm, representative image 
shown (n=5 replicates).c, Anadditional example of primary cell integration 
into organoids after transplant, in which the primary cells integrate into 
organoid rosettes (n=7 primary samples into 21 organoids across 2 
independent studies). d, t-SNE of pre- and post-transplant primary cells, as well 
as the cluster designations. Many cell types represented in pre-transplanted 
cells are not present in the post-transplant population. e, Subtype similarity 
correlation between pre-transplant, post-transplant, and primary aggregate 
samples. Includes plot (bar is average subtype correlation, error bars ares.e.) 
as areplicate of the experiment in Fig. 5b, validating that at older organoid ages 
(week 12) the post-transplanted cells are still significantly impaired in their 
subtype specification (****P=1.46 x10, n=2 primary biologically 


independent samples into 2 organoids in addition ton=5 biologically 
independent samples into 10 organoids in Fig. 5, two-sided Welch’s t-test). 
Primary aggregates are significantly impaired in their subtype specification 
(**P= 0.0016), but are significantly better than post-transplanted primary cells 
(**P= 0.0037). This may be related to non-neural populations in the aggregates. 
f, Transplanted organoid cells were visualized in the mouse cortex after 2and5 
weeks post-transplant (n =13 independent mice transplanted with 14 organoids 
derived from 2 induced PSC lines across 2 independent experiments). Human 
cells were visualized by GFP and human nuclear antigen (HNA) expression. 
Organoid-derived cells expressed markers of progenitors (SOX2 and PAX6), 
neurons (CTIP2, SATB2 and NEUN) and astrocytes (GFAP and HOPX). Mouse- 
derived vascular cells (laminin and CD31) innervate the organoid transplant. 

g, After 2 weeks post-transplantation, organoid cells showed reduced 
expression of the glycolysis gene PGK1 and ERstress genes ARCNI1 and GORASP2 
(n= 6 transplanted mice stained with each marker independently from 2 
induced PSC lines across 2 independent experiments). h, Subtype correlation 
analysis of pre- and post- transplanted organoid cells shows an increase inoRG 
subtype identity (similarity to primary cluster 26) and in newborn neurons 
(similarity to primary cluster 22). 
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for data analysis 
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Single-cell RNA sequencing data has been deposited in dbGAP for accession “A Cellular Resolution Census of the Developing Human Brain” and in GSE132672. An 
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retrieved directly from the single-cell data available in public repositories or from the UCSC cell browser website. 


> 
jad) 
= 
e 
= 
a 
= 
o 
Za) 
© 
fev) 
= 
a 
=r 
= 
o 
i) 
Oo 
a 
5 
a 
a) 
(= 
5 
S 
fev) 
5 
< 


6) 


D 


8L0E 40qG0} 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


[x | Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Sample sizes were determined by number of biological replicates of distinct individuals and were chosen to be a minimum of three, but 
potentially more were used. No statistical methods were used to determine sample size. 


> 
jad) 
= 
‘= 
= 
a 
= 
o 
Za) 
© 
fev) 
= 
a 
=r 
= 
@ 
i) 
S) 
a 
= 
a 
a) 
= 
5 
=: 
fev) 
5 
S 


Data exclusions o data were excluded 


Replication Reproducibility was ensured by repeating experiments multiple times, including doing experiments on different days, even if distinct indivuals 
were being assayed. All replication attempts were successful. 


Randomization Randomization was performed computationally for subsets of analysis, including WGCNA and variancePartition. 


Blinding o investigators were blinded in this study and blinding was not relevant because we used quantitative methods to measure our results, 
especially with regards to single-cell analysis 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
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n/a | Involved in the study n/a | Involved in the study 
[x] Antibodies x ChIP-seq 
[x Eukaryotic cell lines x | Flow cytometry 
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[x Animals and other organisms 
[x Human research participants 
x Clinical data 
Antibodies 
Antibodies used Sox2 (Santa Cruz, 1:500, sc-365823), Hopx (Santa Cruz, 1:250, sc-398703), Satb2 (Abcam, 1:250, ab51502), AUTS2 (abcam, 1:100, 
ab243036), Human Nuclei (Millipore, 1:500, MAB1281), Rabbit: Hopx (Proteintech, 1:500, 11419-1-AP), GORASP2 (Proteintech, 
1:50, 10598-1-AP), ARCN1 (Proteintech, 1:50, 23843-1-AP), PGK1 (Thermo Fisher 1:50, PA5-13863), PTPRZ1 (Atlas, 1:250, 
HPAO15103), NR2F1 (Novus, 1:100, NBP1-31259), Rat: Ctip2 (Abcam, 1:500, ab18465), Sheep: Eomes (R&D, 1:200, AF6166), 
Guinea pig: NeuN (Millipore, 1:500, ABN90), Chicken: Gfp (Aves, 1:500, GFP-1020). 
Validation Only previously published antibodies, or antibodies with company based validation of correct size were used. Website of each 


company can be consulted. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) hESC lines: H1 - obtained from WiCell. iPSC lines: 1323_4, WTC10 from the Gladstone Institute, H28126 from University of oO 
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Authentication Lines were obtained using MTA approval; at time of generation lines were karyotyped for normal identity. S 
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Commonly misidentified lines No ICLAC lines were used 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Postnatal day four (p4) male and female NSG (NOD.Cg-Prkdcscid II2rgtm1Wjl/SzJ, stock No:005557) mice were used for this 
study. 

Wild animals The study did not involve wild animals. 

Field-collected samples The study did not involve field-collected samples. 

Ethics oversight Mouse experiments were approved by UCSF Institutional Animal Care and Use Committee (IAUCUC) protocol AN178775-01. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Human tissue samples were collected without any identifying information including sex or race. 

Recruitment No human participants were used in this study; human tissue was collected from elective terminations with the patients prior 
consent. 

Ethics oversight All primary tissue was obtained and processed as approved by the UCSF Human Gamete, Embryo and Stem Cell Research 


Committee (GESCR) approval 10-05113 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Oninfection of their host, temperate viruses that infect bacteria (bacteriophages; 


hereafter referred to as phages) enter either alytic or alysogenic cycle. The former 
results in lysis of bacterial cells and phage release (resulting in horizontal 
transmission), whereas lysogeny is characterized by the integration of the phage into 
the host genome, and dormancy (resulting in vertical transmission)". Previous co- 
culture experiments using bacteria and mutants of temperate phages that are locked 
inthe lytic cycle have shown that CRISPR-Cas systems can efficiently eliminate the 
invading phages”. Here we show that, when challenged with wild-type temperate 
phages (which can become lysogenic), type 1 CRISPR-Cas immune systems cannot 
eliminate the phages from the bacterial population. Furthermore, our data suggest 
that, in this context, CRISPR-Cas immune systems are maladaptive to the host, owing 
to the severe immunopathological effects that are brought about by imperfect 
matching of spacers to the integrated phage sequences (prophages). These fitness 
costs drive the loss of CRISPR-Cas from bacterial populations, unless the phage 
carries anti-CRISPR (acr) genes that suppress the immune system of the host. Using 
bioinformatics, we show that this imperfect targeting is likely to occur frequently in 
nature. These findings help to explain the patchy distribution of CRISPR-Cas immune 
systems within and between bacterial species, and highlight the strong selective 
benefits of phage-encoded acr genes for both the phage and the host under these 


circumstances. 


CRISPR-Cas adaptive immune systems provide sequence-specific 
resistance against phage infections by inserting phage-derived 
sequences (spacers) of around 30 bp into CRISPR loci on the host 
genome’. Upon reinfection, CRISPR transcripts guide Cas proteins to 
destroy the matching target®. The uptake of new spacers is far more effi- 
cient ifa pre-existing spacer has partial complementarity to the phage. 
This process (known as ‘priming’) is widespread in type | CRISPR-Cas 
systems, and provides protection against phage mutants that over- 
come host resistance by point mutation of their target sites®’. How- 
ever, partially matching spacers can also cause immunopathological 
effects when temperate phages enter the lysogenic lifecycle, during 
which the phage genome is integrated into that of the host and exists 
ina dormant state until it is induced. Lysogens (that is, bacterial cells 
that carry at least one prophage) are found across bacterial genera; 
however, their frequencies vary®. During lysogeny, a primed CRISPR- 
Cas immune system may target the partially complementary site in 
the prophage, causing damage to both the phage and host DNA and 
resulting in the induction of the SOS response’. However, whether 
and how these potentially negative effects influence the evolution- 
ary and population dynamics of CRISPR-phage interactions remains 
unclear, because these processes have previously been studied only 
in the context of virulent phages?°. 


Temperate phages persist despite CRISPR 


To explore CRISPR-phage interactions in the context of lysogeny, we 
infected Pseudomonas aeruginosa PA14 with the temperate phage DMS3 
or the virulent mutant DMS3vir (a DMS3 mutant that is locked in the 
lytic cycle through mutation of the c-repressor gene) and monitored 
bacterial and phage population dynamics for eight days. P. aeruginosa 
strain PA14 carries a type I-F CRISPR-Cas immune system, in which 
spacer 1 of CRISPR array 2 has an imperfect match (5 mismatches) to 
gene 42 of DMS3””°. As previously reported’, DMS3vir was driven extinct 
by wild-type bacteria at five days post-infection owing to the evolu- 
tion of CRISPR-based resistance, whereas a Acas7 mutant of the PA14 
strain (hereafter Acas7—which lacks a functional CRISPR-Cas system— 
evolved surface-based resistance that allowed for phage persistence 
(Fig. la—c). By contrast, both wild-type and Acas7 bacteria were unable 
to clear wild-type DMS3 infections (Fig. 1b), which suggests that the 
ability to transmit vertically (during lysogeny) is acritical determinant 
of the survival of temperate phages when bacteria encode CRISPR-Cas 
immune systems. To understand the evolutionary drivers of these 
population dynamics, we isolated bacterial clones from the DMS3- 
infected cultures at three days post-infection and quantified the pro- 
portion of lysogens and bacteria with CRISPR-based and surface-based 
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Fig. 1| Phage persistence and evolution of resistance in the host upon 
infection with virulent or temperate phages. a, b, Phage densities over time 
after infection of wild-type (WT) PA14 or the Acas7 mutant with DMS3vir (a) or 
DMS3 (b). The limit of phage detection is 200 plaque-forming units per 
millilitre (PFU ml”). c, d, Fraction of bacteria that evolved resistance at 

three days post-infection (dpi) after infection with the phages DMS3vir (c) or 
DMS3 (d), either through CRISPR-Cas (CRISPR), surface modification (sm) or 
lysogeny (lysogen). Fractions are based on 24 randomclones per replicate 
experiment. In all panels, data are the mean of six biologically independent 
replicates per treatment. Error bars represent 95% confidence intervals. 


resistance. This showed that the evolution of CRISPR-based resistance 
was substantially reduced in wild-type populations of bacteria exposed 
to DMS3 compared to those exposed to DMS3vir (Fig. Ic, d); instead, 
many wild-type bacteria carried the DMS3 prophage (Fig. 1d), which 
confers phage resistance through superinfection exclusion. However, 
lysogeny levels were reduced in wild-type hosts compared to in the 
Acas7 strain, which almost invariably carried the prophage (Fig. 1d). 

The difference in DMS3 and DMS3vir persistence could be due to 
differences in the evolution of resistance in the host or to the ability 
of DMS3 to transmit vertically while continuously releasing free phage 
particles through prophage induction. To distinguish between these 
possibilities, we infected wild-type and Acas7 strains with an equal mix 
of DMS3 and DMS2vir, and observed bacterial and phage population 
dynamics and resistance evolution similar to those that were observed 
during infection with the temperate phage alone (Extended Data Fig. 1). 
Next, we examined howthe relative frequencies of DMS3 and DMS3vir 
changed over time (Fig. 2a, b). Initially, DMS3vir outcompetes DMS3, 
consistent with the idea that high densities of sensitive hosts favour 
horizontal transmission (the lytic replication cycle)". However, at later 
time points, all free phage particles belong to the DMS3 phage geno- 
type. As the host population that was experienced by the two phages 
was identical, this demonstrates that vertical transmission facilitates 
the observed persistence of DMS3. The persistence of free phages 
was facilitated despite the low frequencies of lysogeny in wild-type 
compared to Acas7 bacteria (compare the blue and red lines in Fig. 2a, 
b, Extended Data Fig. Ic, d). 

To understand why lysogen formation was depressed in wild-type 
bacteria compared to Acas7 bacteria, we performed temporal sam- 
pling of DMS3-infected populations over seven days. This revealed 
that lysogeny in wild-type bacteria was already depressed at one 
day post-infection and continued to decline until seven days post- 
infection, whereas the proportion of lysogens in isogenic mutants with 
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Fig. 2| The effect of CRISPR adaptation and interference onlysogeny and 
phage persistence. a, b, Relative frequencies of DMS3 over time after infection 
of wild-type PA14 (a) or the Acas7-mutant (b) host with an equal mix of DMS3 
and DMS3vir. Relative frequencies are shown both for the free and total (that is, 
including lysogens) phage population. c, Percentage of DMS3 lysogens inthe 
host population at 1, 3 or 7 days post-infection of the wild-type PA14 strain, the 
isogenic CRISPR-interference deficient mutants Acas7 and Acas3, or the 
isogenic CRISPR-adaptation-deficient mutant Acas]J, based on 24 random 
clones per replicate experiment per time point. d, e, DMS3 phage (d) and 
bacterial densities (e) during this co-culture experiment. CFU, colony-forming 
unit. In all panels, data are the mean of six biologically independent replicates 
per treatment. Error bars represent 95% confidence intervals. 


a defective CRISPR-interference pathway (the Acas3 and Acas7 mutants 
of PA14) was high and constant during this same period (Fig. 2c). Cru- 
cially, the proportion of DMS3 lysogens at one day post-infection was 
also depressed in a AcasI mutant of PA14, which is unable to acquire 
novel spacers (adaptation) but is proficient in detecting and destroy- 
ing complementary DNA (interference). However, inthis background, 
the proportion of lysogens increased between one and seven days 
post-infection to levels similar to those in the CRISPR-interference 
mutants (Fig. 2c). These data therefore suggest that wild-type and Acas1 
bacteria are partially resistant to DMS3 infection (even in the absence 
of spacer acquisition), which results initially in fewer lysogens and 
further reductions ifthe hosts carry the genetic machinery to acquire 
additional spacers. 


Mismatched spacers are maladaptive 

We hypothesized that the observed suppression of lysogeny in wild- 
type and Acas1 backgrounds was dependent on the imperfect match 
(5 mismatches) between gene 42 of DMS3 and spacer 1 of CRISPR array 2 
(refs. *"°). To test this, we infected a strain that lacks CRISPR array 2 
(ACRISPR2) with DMS3 and observed that the plasmid-based expres- 
sion of spacer 1 (ACRISPR2-sp1) led to population dynamics similar to 
those observed during infection of the wild-type strain (Extended Data 
Fig. 2a, b) and suppression of lysogeny (Extended Data Fig. 2c), whereas 
expression of a control non-targeting spacer (ACRISPR2-NT) did not. 


To further corroborate the hypothesis that the suppression of lysogeny 
was caused by partial CRISPR-Cas resistance, we performed infections 
with phages that carry acr genes, which are widespread and diverse 
genes (currently classified into 47 families) that are encoded by arange 
of mobile genetic elements and that block CRISPR interference”. As 
expected, the infection of wild-type bacteria with a mutant DMS3 
phage that carries acrIF1 or acrIF4 (both of which block the type I-F 
CRISPR-Cas system of PA14"’) showed levels of lysogeny similar to those 
observed for interference-deficient mutants (Extended Data Fig. 3a). 

Although lysogen formation was reduced in wild-type and Acas1 
bacteria, the concentration of free phage was actually higher during 
the early stages of DMS3 infection in these hosts (Fig. 2d). This effect 
disappeared by seven days post-infection if bacteria were unable to 
acquire new spacers (the Acas1 strain), or inverted at four days post- 
infection ifthe bacteria had a functional CRISPR-Cas system (Fig. 2d). 
This also coincided with reduced bacterial densities of wild-type and 
Acas1 populations compared to CRISPR-interference mutants during 
the early stages of the phage epidemic (Fig. 2e), which suggests that 
functional CRISPR-Cas immune systems may be maladaptive in this 
context. We speculated that the high DMS3 titres during early stages of 
infection of wild-type and Acas1 bacteria (Figs. 1b, 2d), and the concur- 
rent reduced bacterial densities, could be due to interactions between 
the CRISPR-interference machinery and the partially complementary 
prophage (autoimmunity), which can activate an SOS response’ and 
which—in turn—may trigger prophage induction”. To test this, we 
isolated six independent DMS3 lysogens from wild-type, Acas7 and 
Acas1 backgroundsat one day post-infection. Growth measurements 
revealed considerable fitness costs of lysogeny in wild-type and Acas1 
backgrounds, whereas the growth of Acas7 lysogens was unaffected 
(Fig. 3a—c). The growth of lysogens that lack spacers 1 and 2 of CRISPR2 
(Asp1-2) or lack the entire CRISPR 2 array (ACRISPR2-NT) was com- 
parable to that of Acas7 lysogens (Fig. 3d, Extended Data Fig. 2c), but 
restoring the expression of CRISPR 2 spacer 1 (ACRISPR2-sp1) resulted 
in areduced growth rate (Extended Data Fig. 2d), which confirms that 
this spacer is responsible for the reduced fitness of wild-type bacteria 
that carry the prophage. Competition of lysogens against bacteria 
with surface-based resistance confirmed a fitness cost of lysogeny in 
the wild-type—but not in the Acas7—background (relative fitness <1, 
one-tailed Wilcoxon signed-rank and t-tests; wild type, degrees of free- 
dom =5, P=0.016; Acas7, t;= 3.6551, P= 0.99) (Fig. 3e, f). This effect 
was reduced if the prophage encoded an Acr (one-tailed t-test; acr/F1, 
t;= 16.562; acrIF4, t;=14.805 and P< 0.0001 in both cases) (Fig. 3e), 
whereas acr genes had no effect on fitness of Acas7 lysogens (analysis 
of variance (ANOVA) F,,,; = 0.205, P= 0.82) (Fig. 3f). 

Next, we estimated prophage induction levels in wild-type and Acas7 
lysogens. To this end, we pelleted bacterial cultures and washed away 
free phages, and then resuspended in fresh medium, and monitored 
bacterial densities and free phage accumulation over 25 h. Although 
initial cell densities were similar for all strains, the growth of wild-type 
cultures plateaued at a lower density than that of other backgrounds 
(Extended Data Fig. 3b), but wild-type cultures accumulated more 
phage particles compared to Acas7 lysogens (Extended Data Fig. 3c)— 
unless the prophage carried an acr gene. Taken together, these data sup- 
port the hypothesis that (unless DMS3 encodes acr genes) CRISPR-Cas 
causes immunopathology during vertical transmission of the phage, 
which triggers prophage induction and therefore explains the high 
free-phage titres during the early infection stage. 

These immunopathological effects may select for CRISPR mecha- 
nisms that selectively target phages during their lytic cycle, as has pre- 
viously been described for type Ill systems*?°, However, competition 
between DMS3 and DMS3vir phages showed that the main determinant 
of their relative fitness is the proportion of sensitive bacteria in the 
population” (two-way ANOVA, F,,;=3.1683, P= 0.008) (Extended Data 
Fig. 3d) and is independent of whether resistance of bacteria in the 
population was CRISPR-based or surface-based (F,,,,= 2.8923, P=0.09) 
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Fig. 3| Fitness of lysogens with an active CRISPR-Cas system is reduced 
unless they encode acr genes. a, Twenty-four-hour growth curves of 
uninfected control cultures, or six independent DMS3 lysogens in wild-type 
PA14 (a), Acas1 (CRISPR-adaptation-deficient) (b) and Acas7 (CRISPR- 
interference-deficient) (c) genetic backgrounds. Lysogens were isolated from 
day 1 of the co-culture experiment shown in Fig. 2. Curves are the mean of six 

(a, b) or four (c) replicates and shaded areas represent s.e.m. OD¢oonm, Optical 
density at600 nm. d, Twenty-four-hour mean growth curves of six lysogens in 
wild-type, Acas7and Asp1-2 (carrying a deletion of CRISPR2 spacers land 2) 
backgrounds isolated from six biological replicates. Each growth curve was 
performed in five technical replicates. Shaded areas represent s.e.m. 

e, f, Fitness relative to a mutant of PA14 with a surface modification (sm) of wild- 
type PA14 lysogens isolated one day post-infection with DMS3 (e) or PA14 Acas7 
lysogens isolated one day post-infection with DMS3 (f). Relative fitness was 
determined after one day of competition. Each point represents the average 
relative fitness of one independent lysogen clone measured across six 
biologically independent experiments. Error bars indicate 95% confidence 
intervals. 


(Extended Data Fig. 3d). This suggests that the type 1 CRISPR-Cas sys- 
tem of P. aeruginosa lacks the ability to distinguish between phages 
that enter lytic or lysogenic cycles. 

Given these opposing fitness effects of CRISPR-Cas immune systems 
during horizontal”? and vertical transmission of DMS3 (Fig. 3), it is 
unclear what the net fitness effects of CRISPR-Cas systems are during 
temperate-phage infections. To explore this, we competed wild-type 
and Acas7 strains in the presence of DMS3vir, DMS3, DMS3 acr/F1 or 
DMS3 acr/F4 phages and determined their relative fitness at days 1,3 
and 7. Notably, although wild-type bacteria were fitter than Acas7in 
the presence of the virulent phage (one-tailed Wilcoxon signed-rank 
test, relative fitness >1, P=0.016) (Fig. 4a), they were considerably less 
fit during temperate-phage infection (relative fitness < 1, P= 0.018) 
(Fig. 4a), unless the temperate phage carried acr genes (relative fit- 
ness #1, P=0.16) (Fig. 4a). Therefore, in this context, acr genes not only 
provide a benefit to the phage during both horizontal*””"* and vertical 
transmission of the phage (one-tailed t-test, relative fitness #1; acr/F1, 
t;=39.30, P< 0.0001; acriF4, t;=11.61, P< 0.0001) (Fig. 4b) but also to 
the host, by preventing autoimmunity. 
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Fig. 4 | Lysogens evolve to mitigate fitness costs. a, Relative fitness of wild- 
type PA14 during competition with the Acas7 mutant strain following infection 
with 10* PFU of DMS3, the lytic mutant DMS3vir or the anti-CRISPR-encoding 
mutants DMS3 acr/F1 and DMS3 acrIF4.b, Relative fitness of DMS3 (free 
phages + lysogens) following three days of competition with DMS3 acr/F1 or 
DMS3 acrIF4 on either wild-type PA14 or Acas7 mutant. c, d, Twenty-four-hour 
growthcurves of uninfected control cultures, or six biologically independent 
DMS3 lysogens in the wild-type PA14 (c) or Acas1 (d) genetic backgrounds, 


Bacteria evolve to lose CRISPR-Cas 


We next monitored whether wild-type lysogens evolved to alleviate 
these autoimmunity costs through mutation of their immune system 
or the prophage. Lysogens in wild-type and adaptation-deficient Acas1 
backgrounds, which had reduced growth during early infection, were 
selected froma late stage of the infection (seven days after infection). 
Growth curves revealed that the negative fitness consequences of 
CRISPR-Cas immune systems had disappeared in these ‘late’ lysogens, 
and that these bacteria now had growth rates comparable those of 
ancestral hosts that lack the prophage (Fig. 4c, d; compare to Fig. 3a,b). 
Acompetition experiment confirmed this finding, which shows that late 
lysogens in a wild-type background were fitter than a surface mutant 
(one-tailed t-test, relative fitness >1, t,=5.985, P< 0.001) (Fig. 4e), and 
the presence of acr genes in the prophage no longer increased the fit- 
ness of the host (one-tailed t-test, acr/F1, t;=-5.5085, P=1.00; acriF4, 
t;=—2.0395, P= 0.95) (Fig. 4e, compare to Fig. 3e). 

To understand the mechanistic basis for this alleviation in fitness 
costs, we performed PCR analyses of the CRISPR loci of six independent 
lysogens in wild-type, Acas1 and Acas7 backgrounds, isolated from early 
or late time points after infection. Both CRISPR loci (land 2) amplified 
as expected in early lysogens (Extended Data Fig. 4a); however, amplifi- 
cation failed in many late lysogens in wild-type and AcasI backgrounds 
(Extended Data Fig. 4a, negative PCR indicated by red frame), which 
suggests that the CRISPR-Cas locus was lost in these clones. Whole- 
genome sequencing of late lysogens revealed large genomic deletions 
(between about 50 and 230 kb), associated with prophage integra- 
tion, that encompassed the entire CRISPR-Cas locus—but contained 
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which were isolated from day 7 of the co-culture experiment shown in Fig. 2. 
Curves are the mean of six replicates and error bars represent s.e.m. e, Relative 
fitness of aDMS3 lysogen ina wild-type PA14 genetic background, isolated 
from day 5 ofaco-culture experiment, during competition witha surface 
mutant. Relative fitness was calculated after one day of competition. All panels 
show the mean of six biologically independent replicates per treatment. Error 
bars represent 95% confidence intervals (a, b, e) or s.e.m. (c,d). 


no essential gene’’—in wild-type and Acas1 backgrounds, whereas 
the genome remained intact in Acas7 late lysogens (Extended Data 
Fig. 4c—e, Extended Data Table 1). This loss of the CRISPR-Cas locus 
was Clearly driven by the immunopathological effects of CRISPR 2 
spacer 1, as the locus was maintained in lysogens that lack CRISPR 2 
(ACRISPR2) but was lost when expression of spacer 1 was restored 
(ACRISPR2-sp1) (Extended Data Fig. 2b). The loss of CRISPR-Cas was 
also avoided when wild-type bacteria were lysogenized by DMS3 that 
carry acr genes (Extended Data Fig. 4a). 


Generality of the empirical data 


Collectively, these data show that temperate-phage infection can drive 
the rapid loss of CRISPR-Cas immune systems from bacterial genomes 
owing to immunopathological effects that manifest during vertical trans- 
mission. To generalize our findings beyond phage DMS3 (which belongs 
tothe Caudovirales), we introduced a priming spacer with one mismatch 
against the PfS prophage (which belongs tothe /noviridae) that is naturally 
present in the genome of wild-type PA14”°. Expression of a Pf5-priming 
spacer caused reduced growth of the wild-type PA14 strain and strong 
selection for bacteria that carry deletions in their CRISPR-Cas immune 
system (Extended Data Fig. 5). These observations therefore suggest that 
immunopathological effects generally drive selection for CRISPR loss, 
whenever hosts carry spacers primed against their prophage. Indeed, 
when we formalized these ideas ina theoretical framework, we recovered 
population and evolutionary dynamics very similar to those observed 
in our experiments (Extended Data Fig. 6; a detailed description of the 
model is provided in the Supplementary Information). 


Crucially, in nature, priming is thought to be frequent owing 
to the relaxed sequence-identity requirements for triggering 
this pathway: up to 13 mismatches between a pre-existing spacer 
and the target are tolerated’. Indeed, analysis of the spacers from 
more than 170,000 bacterial genomes and a dataset of about 
20,000 prophages” revealed that pre-existing spacers with perfect 
(no mismatches) or imperfect (1-5 mismatches) targets to temper- 
ate phages are common within genera (Extended Data Fig. 7; further 
information is inthe Supplementary Information). On average, 49% 
of all prophages were targeted by priming spacers carried by bacte- 
ria within the same genera as the lysogen (Extended Data Fig. 7c). If 
mismatched spacers generally cause residual DNA cleavage activity 
by the CRISPR immune system, they would be expected to cause 
immunopathological effects similar to those observed for the P. aer- 
uginosa-DMS3 interaction. Indeed, when all complete P. aeruginosa 
genomes were assessed, we observed a significant enrichment for 
an increase in the frequency of acr genes when self-targeting spac- 
ers matched prophages (Extended Data Fig. 8), which is consistent 
with the idea that these spacers cause immunopathological effects. 
Together, these bioinformatic analyses suggest that—in nature—the 
maladaptive effects of CRISPR-Cas against temperate phages are 
likely to be common. 

Innatural ecosystems, bacteria are frequently exposed to both tem- 
perate and virulent phages, and the latter may superinfect the lyso- 
gens. To further generalize our findings, we explored whether these 
superinfections would be likely to change the observed population and 
evolutionary dynamics. We found that a mixed infection of wild-type 
PA14 with the temperate phage DMS3 and the virulent phage LMA2 
(which can superinfect DMS3 lysogens) resulted in rates of lysogeniza- 
tion similar to those with the temperate phage alone (Extended Data 
Fig. 9d, e) and high fitness costs of CRISPR-Cas (Extended Data Fig. 9g), 
which drive the evolutionary loss of the immune systems (Extended 
Data Fig. 9f). Our theoretical model recovered similar dynamics, inde- 
pendent of whether CRISPR-based resistance evolves against only 
one (as in our experiment) or against both phages (Extended Data 
Fig. 9h-o). 


Discussion 


The observations that approximately 60% of all bacterial genotypes lack 
CRISPR-Cas systems, and that closely related strains differ in whether 
they encode cas genes, suggest that these systems are frequently gained 
and lost from bacterial genomes”. Although phage infection is typically 
assumed to be an important selective force for the maintenance of 
CRISPR-Cas immune systems, this work reveals their rapid loss when 
exposed to a temperate phage, as aresult of the immunopathological 
effects that occur when the CRISPR immune system is primed against 
this phage. Some CRISPR-Cas immune systems may mitigate some of 
these autoimmunity costs by limiting the spacer acquisition from, and 
cleavage of, transcriptionally silent elements’. However, CRISPR- 
based autoimmunity has frequently been reported”* ”, and the high 
frequencies at which primed self-targeting interactions occur suggests 
that this is probably a major driver of the evolutionary loss of CRISPR 
systems in nature. 
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Methods 


No statistical methods were used to predetermine sample size. The 
experiments were not randomized and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Bacterial strains and viruses 

P. aeruginosa UCBPP-PA14 (referred to as wild type) and P. aeruginosa 
UCBPP-PA14 csy3::lacZ (referred to as Acas7) were used in all experi- 
ments, and have previously been described”’. The surface mutant 
Tn::pilA (which lacks a pilus, and is referred to as sm2) has previously 
been described”’ and was used in competition experiments witha bacte- 
riophage insensitive mutant (BIM) with two additional acquired spacers 
over those present in the wild-type strain, as previously described’. 

Lysogens in wild-type or CRISPR-mutant backgrounds used in growth 
curves, fitness and induction experiments were generated in this 
study; the presence of a prophage was confirmed by PCR. PA14 strains 
Acas3, Acas1, ACRISPR2 and CRISPR2 Asp1-2 identified, respectively, 
as SMC4268, SMC4277 and SMC3895 and SMC4707, have previously 
been described’®”’. 

Phage amplifications were carried out on P. aeruginosa UCBPP-PA14 
csy3::lacZ.DMS3 and the obligate lytic variant DMS3vir have previously 
been described*°. DMS3 acr/F1 and DMS3 acrIF4 phages were generated 
inthis study following a method for hybrid phage construction detailed 
ina previous study”. DMS3vir acr/F1 was used in downstream analyses 
and has previously been described’. A virulent phage capable of infect- 
ing DMS3 lysogens (LMA2, which has previously been described”) was 
used in co-culture experiments. 

All bacterial strains were grown at 37 °C in LB broth or M9 medium 
(22 mM Na,HPO,; 22 mM KH,PO,; 8.6 mM NaCl; 20 mM NH, CI; 1 mM 
MgsO,; and 0.1mM CaCl,) supplemented with 0.2% glucose. When 
appropriate, medium was further supplemented with gentamycin (SO 
mg/ml) and arabinose (1% w/v). 


Evolution experiments 

To monitor the evolution of bacterial resistance in response to 
phage infection and the associated bacterial and phage popula- 
tion dynamics, microcosms with 6 ml of M9 medium supplemented 
with 0.2% glucose were inoculated with approximately 10° CFU bac- 
teria from fresh overnight cultures of the corresponding bacte- 
rial strains. These cultures were infected with 10* PFU of DMS3vir, 
DMS3, DMS3 acr/F or LMA2 phages, followed by incubation at 37 °C 
and shaking at 180 rpm. Cultures were transferred 1:100 into fresh 
medium every 24 h for 5-8 days. All experiments were performed 
in six independent replicates. 


Determination of resistance phenotypes 

Resistance phenotypes were determined at day 3 or day 7 by streaking 
individual colonies (24 randomly picked per replicate) through DMS3vir 
and anti-CRISPR phage DMS2vir acr/F1. Surface modification was con- 
firmed by colony morphology, broad-range resistance to DMS3vir 
and DMS32vir acrIF1, and a lack of newly acquired spacers. Lysogens 
were determined by broad resistance to both phage anda positive PCR 
amplification of the c-repressor gene in bacterial genome, amplified 
using primers 5’-GCGGAATGAGCGCTAAACC-3’ and 5’-CAAGTGCT 
TTAGCGAGGAATGC-3’. CRISPR resistance was confirmed by resistance 
to DMS3vir, but not to DMS3vir acriF1 and a PCR confirming spacers 
had been added to one of the CRISPR arrays. Primers 5’-CTAAGCCT 
TGTACGAAGTCTC-3’ and 5’°-CGCCGAAGGCCAGCGCGCCGGTG-3’ were 
used to amplify CRISPR array 1, and primers 5’°-GCCGTCCAGAAGTCA 
CCACCCG-3’ and 5’-TCAGCAAGT TACGAGACCTCG-3’ for CRISPR array 
2. As a positive control for PCR, primers 5’-GCTTGCAGTTCCTCAA 
CGAG-3’ and 5’CACCAGGAAAT TCAGGTAGGG-3’ were used to amplify 
the housekeeping control gene fimV, which encodes a protein that 
is involved in pilus formation. 


Bacterial and phage titres 

Bacterial densities were determined by plating on LB agar dilutions 
of samples taken at each transfer in M9 salts (22 mM Na,HPO,; 22 mM 
KH,PO,; 8.6 mM NaCl; 20 mM NH, CI; 1mM MgSO,; and 0.1mM CaCl). 
Phages were extracted at each transfer by chloroform extraction 
(sample:chloroform 10:1, v/v), and phage titres were determined by 
spotting serial dilutions of isolated phage samples in M9 salts ona 
lawn of PA14 Acas7. 


Phage competition 

Fixed phenotypes. Competition experiments were performed in glass 
vials in 6 ml of M9 medium. Experiments were initiated by inoculating 
a1:100 dilution of different mixes of overnight cultures of the Acas7 
strain and either the surface mutant ApilA” (sm2) or BIM strain (2 spac- 
ers targeting DMS3 phage). A 1:1 mix of the phages DMS3vir and DMS3 
was added (108 PFU) and the vials were incubated at 37 °C with shaking 
for 8 h. Phages were extracted by chloroform extraction and spot as- 
says carried out to determine phage titre. To determine phage ratios 
(DMS3:DMS32vir), plaque assays were carried out by serially diluting 
phage extractions and adding 200 pl of the selected dilution to 600 ul 
of Acas7 overnight culture and 6 ml molten top agar (0.5%), which was 
then poured over a prewarmed LB agar plate. The plates were incubated 
overnight at 37 °C and the plaques generated by temperate and viru- 
lent phages were discriminated by differences in opacity, and a subset 
confirmed by PCR. All experiments were performed in six replicates. 


Coevolution competitions. Phage competition was also measured 
inthe presence of host evolution. Initially sensitive Acas7 or wild-type 
hosts were infected with 10* PFU of a50:50 mix of temperate and viru- 
lent phages (DMS3 and DMS3vir). The experiment was run as a standard 
evolution experiment, and bacterial and phage titres measured every 
day for seven days. Resistance phenotypes were assessed on days 3 
and 7, as described in ‘Determination of resistance phenotypes’, and 
plaque assays used to determine the ratio of temperate to virulent 
phage at each time point. 


Competition DMS3 x DMS3 acrIF. Temperate DMS3 was also com- 
peted against a mutant encoding Acr proteins. Initially sensitive Acas7 
or wild-type hosts were infected with 10* PFU of a50:50 mix of DMS3 and 
DMS3 acrIF1 (or DMS3 acr/F4). The experiment was run for three days 
and samples of total phage population (thatis, free phages + prophages, 
no chloroform extraction) were collected and immediately frozen 
every day. Phage titres were measured every day by spot test. To deter- 
mine their relative fitness, the relative frequencies of each phage were 
determined at t= 0 and at t=3 days by qPCR, following a previously 
described method”. 


Expression of priming spacers 

The expression of the CRISPR RNA encoding spacer 1 of CRISPR2 was 
restored in strain PA14 ACRISPR2 using an arabinose inducible expres- 
sion vector (DHERD30T-based). In brief, oligonucleotides containing 
the sequences ofa non-targeting (NT) spacer (5’°-GTCTTCTT TGAGCTT 
CCAGAGAACTGAAGAC-3’) or of spacer 1 (5’-ATCAGCCGGACGT 
TGTAGTAGTCGAGCGCGGT-3’) and flanked by two CRISPR2 repeats 
were annealed and ligated between Ncol/HindIll restriction sites. The 
resulting plasmids were transformed into PA14 ACRISPR2 to generate 
the strains ACRISPR2-sp1 and ACRISPR2-NT, respectively. Similarly, a 
spacer targeting the natural Pf5 prophage of PA14 (accession number 
AY324828) with 1 mismatch (bold) 5’-AGTCCTTCTAGTGAGCGGAA 
CCAAAATCTATT-3’ was expressed in the wild-type PA14 strain. 


Measuring prophage induction 
Single colonies were picked from plated lysogens and grown in LB 
medium overnight at 37 °C with180 rpm shaking. The overnight culture 


was diluted 1:100 in fresh M9 medium and grown until OD goo nm reached 
about 0.1. The cultures were then pelleted by centrifugation and the 
pellet washed 5 times in M9 buffer, and then resuspended in 10 ml of 
M9 medium. The cultures were grown at 37 °C with shaking, samples 
were taken regularly for phage quantification by spot assay and ODgoonm 
measurements using the Biotek synergy 2 plate reader. 


Twenty-four-hour growth curves 

Single colonies were isolated and grown in M9 medium overnight at 
37 °C with 180 rpm shaking. The following day, this culture was diluted 
1:100 and 250 ul of this mixture was added to a 96-well plate and growth 
curves were measured for 23 hina Thermo Scientific Varioskan flash 
plate reader with continuous shaking at 180 rpm. Readings of ODgoonm 
were taken every 15 min and the plate kept at 37 °C. All growth curves 
were performed in 6-12 replicates. 


Bacterial competition 

Competition experiments were performed in 6 ml M9 medium supple- 
mented with 0.2% glucose. Competition experiments were initiated by 
inoculating 1:100 from a1:1 mixture of overnight cultures (grownin M9 
medium + 0.2% glucose) of each strain. If phages were included, they 
were added at aconcentration of 10* PFU. Cells were transferred 1:100 
daily into fresh broth. At varying time points, from 1to 7 days, samples 
were taken and cells were serially diluted in M9 salts and plated on LB 
agar supplemented with 50 pg mI X-gal (to enable discrimination 
between strains that carry the lacZ gene (blue) and those that do not 
(white)). Allexperiments were performed in six replicates. Relative fit- 
ness was calculated from changes in the relative frequencies of blue and 
white colonies (relative fitness = [(fraction strain A at ¢=x) x (1- (frac- 
tion stain A at t=0))]/[(fraction strain A at t=0) x (1- (fraction strain 
Aatt=x))]). 


Whole-genome sequencing and bioinformatic analyses 

Lysogen clones in the Acas7 background and lysogen clones in wild- 
type and Acas1 backgrounds were isolated from late time points 
of the evolution experiments. Standard genome sequencing and 
standard bioinformatic analyses were provided by MicrobesNG (as 
described at http://www.microbesng.uk). Raw read sequencing data 
has been deposited in the European Nucleotide Archive under the 
study accession number PRJEB34503. Trimmed reads were mapped 
to wild-type PA14 reference genome (accession number NC_008463) 
with Geneious 9.1.8 software using Bowtie2 mapper® to identify 
the genomic deletion. Reads were also mapped to DMS3 reference 
genome (accession number DQ631426) and hybrid reads composed 
of a5’ extremity deriving from PA14 and a3’ extremity matching the 
DMS3 5’ end (and vice versa, read 5’ extremity matching DMS3 3’ end 
and 3’ extremity deriving from PA14) were extracted. These hybrid 
reads were then mapped back to PAI14 genome to identify prophage 
insertion sites. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Source Data associated with Figs. 1-4 and Extended Data Figs. 1-3, 5, 
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Extended Data Fig. 1| Infection with a50:50 mix of temperate:virulent 
phages. a, b, Bacterial (a) and phage (b) titres during a co-culture experiment 
of wild-type PA14 (red) or Acas7 mutant (blue), and a 50:50 mix of DMS3 and 
DMS3vir.c, d, Resistance phenotypes at day 3 (c) or day 7 (d) of the co-culture 
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experiment, based on 24 random clones per replicate experiment. Dataare the 
mean of six biological replicates per treatment. Error bars represent 95% 
confidence intervals. 
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Extended Data Fig. 2| The suppression of lysogeny and immunopathological 
effects are due to spacer 1 of CRISPR array 2. a, b, Phage (a) and bacterial (b) 
titres during co-culture of phage DMS3 and P. aeruginosa PA14 ACRISPR2, 
expressing anon-targeting spacer froma plasmid (ACRISPR2-NT) or the 
original CRISPR2 spacer 1 (ACRISPR2-sp1).c, d, The proportion of lysogens (c) 
and the frequency of loss of CRISPR-Cas immune systems (d) at land 3 days 
post-infection, based on PCR analyses of 24 random clones per replicate 
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experiment. a-d, Data are the mean of three biological replicates. Error bars 
represent 95% confidence intervals. e-g, Growth of three independent lysogen 
clones isolated at three days post-infection, as determined by OD¢oonm 
measurements. ACRISPR2-NT (e) and ACRISPR2-sp1 (f) lysogen clones carry 
the ancestral ACRISPR2 CRISPR-Cas immune system, whereas the ACRISPR2- 
spl (g) lysogen clones have evolved to lose CRISPR-Cas. 
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Extended Data Fig. 3| Prophage induction rates are increased in hosts with 
active CRISPR- Cas. a, The percentage of lysogens formed upon infection of 
wild-type host with DMS3 phages engineered to produce AcrIF1 or AcrIF4 anti- 
CRISPR proteins. b, c, Optical density (b) and phage titres (c) during growth of 
lysogens of DMS3, DMS3 acrIFl or DMS3 acr/F4 phages ina wild-type PA14 or 
Acas7 genetic background. d, Relative fitness of the DMS3 phage during 
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competition with the virulent mutant DMS3vir in the presence of varying 
fractions of sensitive (Acas7) host and resistant hosts with CRISPR-based 
immunity (BIM) or surface-based immunity (sm2) against these phages. Data 
show mean fitness at 8 h after infection. All panels show the mean of six 
biological replicates and error bars represent 95% confidence intervals. 
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Extended Data Fig. 4| See next page for caption. 
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Extended Data Fig. 4 | Lysogens lose their CRISPR-Cas immune systems. 

a, PCRamplification of the c-repressor gene of the prophage (c-rep, 611 bp), the 
fimV gene (located about 1 Mb from the CRISPR loci and used asa positive 
control for the PCR, 116 bp) and CRISPR loci 1 (349 bp) and 2 (206 bp) onthe host 
genome. PCRs were performed on 6 independent DMS3 lysogens in wild-type, 
Acasiand Acas7 backgrounds isolated at 1 or 7 days post-infection, as well as on 
6 independent lysogens of DMS3, DMS3 acrIF1 or DMS3 acrIF4 (wild-type 
background) isolated at 6 or 120 h after infection. Red frames indicate a failure 
toamplify a product. PCR amplifications were performed on clones isolated 
from three biological replicate experiments and produced similar results. For 
gel source data, see Supplementary Fig. 1. b, Schematic of the CRISPR-Cas 
locus of wild-type PA14, which spans aregion of around 11kb. Primers used to 
amplify regions of CRISPR arrays 1 or 2 are shownas red arrows. c-e, Whole- 


genome sequencing of DMS3 lysogens that lost their CRISPR—-Cas system (red 
frames ina) in wild-type PA14 (c), Acasi (d) or Acas7 (e) backgrounds. Graphs 
show the read coverage of the region encompassing positions 2.70-2.97 Mb of 
the wild-type PA14 genome. The CRISPR-Cas locus is indicated by agreen box 
onthex axis. A genome map depicting coding sequences (yellow arrows) is 
shown above the graphs. The region comprising 2.84-2.88 Mb includes 
sequences that are repeated elsewhere on the PA14 genome, explaining why 
reads that map to these positions are still detected insome of the deletion 
mutants. The high peak at the 3’ end of the CRISPR locus corresponds to the 
coverage of spacer 20 of CRISPR2 by reads that derive from DMS3 prophage (5’ 
and 3’ extremities of these reads map to the phage genome). Spacer 20 of 
CRISPR2 has 100% identity to DMS3 but is not immunogenic because there is 
no consensus protospacer-adjacent motif. 
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Extended Data Fig. 5| Expression of Pf5 priming spacer in P. aeruginosa biological replicates, and shaded areas correspond to 95% confidence 
PA14.a, Growth of Acas7 (dashed line) or wild-type (solid line) clones carrying intervals. b, Relative fitness of wild-type pNT or wild-type pPf5-MS during 

an expression plasmid encoding a non-targeting spacer (pNT) or aspacer competition with Acas7 pNT. Data are the mean of six biological replicates per 
targeting the PA14 natural prophage Pf5 with one mismatch (pPf5-MS), as treatment. Error bars represent 95% confidence intervals. 
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a c 
8, Acinetobacter 
Bacillus 
Clostridium 
Corynebacterium 
6 Enterococcus 
Escherichia 
= Fusobacterium 
Ss Klebsiella 
3 Lactobacillus 
x 4 Listeria 
2 Moraxella 
3 Mycobacterium 
8 Neisseria 
Pasteurella 
21 Pseudomonas 
Salmonella 
Staphylococcus 
Streptococcus 
Vibrio 
04 Xanthomonas 
Yersinia 
prophages _ control 0 1-5 0 10 20 30 40 50 60 70 80 
mismatches % primed prophages 
b d 
Acinetobacter | © 
(n=5746) |) Hibs 
Bacillus | ™™ pophages 
(n=1515) | he | control 
Clostridium | ™ | | 0 mismatches 
(n=2223) | flees a] 1-5 mismatches 
Corynebacterium | | 
(n=900) | by 
Enterococcus | ™ 
(n=2649) | [ihe 
Escherichia | “=== 
(n=9567) | [iii 
Fusobacterium | | 
(n=707) | by 
Klebsiella | © 
(n=2629) | [iho 
Lactobacillus | ad 
Lava Bas log(Count) 
isteria 
(n=6224) | iB 
Moraxella | 
(n=839) | 6 
Mycobacterium | © 5 
(n=523) | F 4 
Neisseria | * 
(n=3706) | [ee 
Pasteurella | 
(n=545) | | 
Pseudomonas | ™ 
(n=14202) | [IIIT ss 
Salmonella | “= 
(n=10783) ] [EBs 
Staphylococcus | ™ 
(n=1704) | bos 
Streptococcus _ 
(n=22640) "| III 
Vibrio | © 
(n=3268) | [ile 
Xanthomonas | ' 
(n=1145) | bows 
Yersinia | a 
(n=1902) | Mi 
T T T T 
0 5 10 15 0 1 2 3 4 5 
Count (x1000) Mismatches 


Extended Data Fig. 7| Matches between spacers and temperate phages are 
widespread. a, Total matches between non-redundant spacers (n =1,239,973) 
from 171,361 RefSeq and GenBank complete genomes and anon-redundant set 
of temperate phages (n =19,996)". The counts of perfect (0) or mismatched 
(1-5) targets are shown. Asacontrol, the temperate phages were shuffled ten 
times, while retaining the hexanucleotide content (control). b, Counts of 
spacers matching temperate phages from all genera with over 500 spacer- 
prophage matches. The total number (n) of spacer-prophage matches is shown 


for each genus in parentheses. Counts of matches are shown (0, green; 1-5 
mismatches, red). The number of temperate phages analysed is plotted 
(prophages in purple) as are the matches to shuffled prophages. The control is 
shown in blue, butis not visible because it had only 0 to 10 counts. c, The 
percentage of prophages within each genus that were targeted by self-priming 
spacers (1-5 mismatches). d, Heat map of the distribution of mismatches (0-5). 
Generaare as inb and data are shownas log(count) for each genus, as the 
number of matches varied widely between genera. 
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shown. For complete P. aeruginosa genomes, all self-targeting events were 
analysed for matches to prophages using PHASTER*™. The number of genomes 
with acr genes (acr +) and self-targeting (ST +) spacersis significantly greater 
than the number of genomes with acr genes and without self-targeting spacers 
(P=8.14 x 10°, two-sided Fisher’s exact test, n=71). 
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Extended Data Fig. 9 | Presence ofa superinfecting virulent phage does not 
alter immunopathological effects. a—c, Bacterial (a) and phage titres upon 
individual (b) or mixed (c) infection of wild-type PA14 with phage DMS3 and 
virulent phage LMA2. d, e, Resistance phenotypes evolved by bacteria against 
DMS3 upon individual (d) or mixed (e) infection. f, Frequency of loss of 


CRISPR-Cas immune systems upon infection with phage DMS3 or with both the 


phages DMS3 and LMA2, based on 24 random clones per replicate experiment. 
g, Relative fitness of wild-type PA14 during competition with PA14 Acas7 inthe 
presence or absence of phages DMS3 and LMA2. a-g, Data are the means of six 
biological replicates. Error bars indicate 95% confidence intervals. 

h-o, Simulations of population and evolutionary dynamics during infection of 
bacteria carrying CRISPR-Cas systems with a mixed population of unrelated 
virulent and temperate phages. Graphs show densities of susceptible hosts, 
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CRISPR-resistant bacteria and lysogens (h, i) and free viruses over time (j, k), as 
well as the frequencies of temperate phages ina population composed of both 
temperate and virulent types (I, m). Temperate phage can transmit both 
horizontally and vertically, whereas virulent phage can transmit only 
horizontally and can superinfect the lysogens (because temperate and virulent 
phages are unrelated). n, o, Frequencies of evolutionary loss of CRISPR-Cas 
system inthe lysogen population over time. The simulations showninh,j,1,n 
reflect ascenario in which bacteria can evolve CRISPR-based resistance against 
both phages, whereas those shown ini, k, m, oreflect a situation in which 
CRISPR-based resistance does not evolve against the virulent phage, and 
bacteria instead evolve costly surface-based resistance (as it is the casein our 
experiments). A detailed description of the simulations is provided 

inthe Supplementary Information. 
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Extended Data Table 1| Genomic deletions and prophage insertion sites in DMS3 late lysogen clones 


Deleted region Prophage insertion site(s) 
Name on 


: Proportion of 
ED Fig. 4 | Start (bp) End (bp) Length 


(bp) Position (bp) iach ak 
2,831,115! 2,938,245; 107,131] replace deleted region 

replace deleted region 

3 3 3,555,252 

2,880,477 : 2,938,285: 57,809 | replace deleted region 


replace deleted region 


2,879,871 | 2,938,280; 58,410 


WT4313 im 2,878,050 : 2,938,509 


4,453,953 

WT566° ‘nie ee: nd 

replace deleted region 
3,325,229 

replace deleted region 


WT6615 5  |2,812,027'2,938,165 


WT7AnneD1| WT_6 |2,743,638'2,938,176 


5,860,538 
2,335,454 


WT8AnneD2 _7  |2,818,271! 2,938,290 


3,079,052 
cas11114 | Acas1_1 |2,918,943!2,970,427: 51,485 | 3,079,190 
replace deleted region 
| | 4,478,625 
cas1333° Ons, Os partial 
7 : 2,730,814 


cas1222 | Acas1_2 | 2,706,504! 2,938,538; 232,035 


cas14415 2,879,882 | 2,938,523 
2 4,481,869 

replace deleted region 

2,934,176 

cas16611 Acas1_4 |2,879,874! 2,938,285 replace deleted region 

cas7114__| Acas7_1 1,120,809 

cas72211 | Acas7_2 7 410,059 

cas73313 | Acas7_3 ; 5,725,887 

cas7442 | Acas7_4 : 5,743,045 

cas7554 - 905,180 

cas7669 : 1,667,097 


cas1555° 2,902,848 2,938,280 


“The sum does not reach 100% because a proportion of hybrid reads that mapped to the PA14 genome did not enable the identification of a potential prophage insertion site. 
’Sample was contaminated with a BIM clone; sequencing data are not interpretable. 

“Only 2,607 bp of DMS3 DNA (including the c-repressor gene) are inserted in bacterial genome at position 5,834,730. 

4a mixed population, in which reads matching the CRISPR locus are still detectable but coverage is very low. 


| i ature [¢ S¢ arc | Corresponding author(s): Anne Chevallereau, Edze Westra, Clare Rollie 


Last updated by author(s): 07/11/2019 


Reporting Summary 


Nature Research wishes to improve the reproducibility of the work that we publish. This form provides structure for consistency and transparency 
in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 
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Human immunodeficiency virus (HIV) persists indefinitely in individuals with HIV who 
receive antiretroviral therapy (ART) owing toa reservoir of latently infected cells that 
contain replication-competent virus’ *. Here, to better understand the mechanisms 
responsible for latency persistence and reversal, we used the interleukin-15 
superagonist N-803 in conjunction with the depletion of CD8* lymphocytes in 
ART-treated macaques infected with simian immunodeficiency virus (SIV). Although 
N-803 alone did not reactivate virus production, its administration after the depletion 
of CD8* lymphocytes in conjunction with ART treatment induced robust and 
persistent reactivation of the virus in vivo. We found viraemia of more than 60 copies 
per mlin all macaques (n= 14; 100%) and in 41 out ofa total of 56 samples (73.2%) that 
were collected each week after N-803 administration. Notably, concordant results 
were obtained in ART-treated HIV-infected humanized mice. In addition, we observed 
that co-culture with CD8*' T cells blocked the in vitro latency-reversing effect of N-803 


on primary human CD4’ T cells that were latently infected with HIV. These results 
advance our understanding of the mechanisms responsible for latency reversal and 
lentivirus reactivation during ART-suppressed infection. 


HIV remains a major global health problem, leading to approximately 
1.1 million deaths worldwide annually®. Despite the major declines in 
morbidity and mortality associated with the use of ART, there is neither 
a vaccine nor acure for HIV infection. The inability to eradicate HIV 
infection with the current therapies is due to the presence of latently 
infected cells that contain integrated replication-competent virus 
that persist indefinitely in HIV-infected individuals undergoing ART 
and contribute to rebound viraemia when therapy is discontinued 
(thatis, the viral reservoir)’ *. A key paradigm in the field of HIV cure— 
referred to as ‘shock and kill®’—supposes that the induction of virus 
expression (that is, ‘virus reactivation’) in these latently infected cells 
(‘shock’) followed by immune-mediated clearing (‘kill’) may substan- 
tially reduce the reservoir size and possibly lead to a functional cure 
for HIV infection. Unfortunately, no latency-reversing agent (LRA) 
tested to date has successfully perturbed the viral reservoir in clinical 
trials in humans. In particular, histone deacetylase inhibitors have not 
induced either robust virus reactivation or reduction of the viral reser- 
voir in ART-treated HIV-infected individuals®’. More encouragingly, 


in SIV-infected ART-treated rhesus macaques (Macaca mulatta), treat- 
ment with toll-like receptor 7 (TLR7) agonists was linked to transient 
blips of plasma viraemia’®. However, this result was not reproducedin 
further studies. More recently, persistent remission was observed 
inasubset of ART-treated simian—human chimeric immunodeficiency 
virus (SHIV)-infected macaques that received the TLR7 agonist GS-9620 
in combination with the broadly neutralizing PGT121 antibody”. In all, 
these published data indicate that novel, more potent approaches 
for latency reversal are needed to achieve a functional cure for HIV 
infection. 


SIV and SHIV infection of rhesus macaques 


Infection of rhesus macaques with SIV or SHIV is the most widely used 
animal model for the study of the mechanisms by which the viral res- 
ervoir is established and maintained under ART and to preclinically 
test interventions aimed at reducing the viral reservoir in vivo’. A pre- 
vious study demonstrated that the depletion of CD8* lymphocytes 


‘Emory Vaccine Center and Yerkes National Primate Research Center, Emory University, Atlanta, GA, USA. Department of Pediatrics, Emory University School of Medicine, Atlanta, GA, USA. 
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Fig. 1| Study design and phenotypic and transcriptomic effects of N-803 
with or without CD8 depletion in rhesus macaques. a, IL-15 superagonist 
N-803 structure. i.v., intravenous; TCID;9, 50% tissue-culture infectious dose. 
b, Study design. Arrows indicate the administration of 100 pg kg 1 N-803 
(green) and50 mg kg‘! MT807RI1 (blue). c, Plasma viral load pre-intervention 
(n=35 macaques), including infection and initiation of ART (grey bar). Limit of 
detection is 60 copies of SIV RNA per ml of plasma (black bar). d, Mean 
peripheral CD4* T cell (maroon), CD8* T cell (purple) and NK cell (grey) countin 
the lymph node. e, Percentage of CD4* and CD8’ T cells in the lymph node. 

f, Ki-67 expression in cellular subsets after intervention with N-803 

(n=7 macaques). g, Ki-67 expression in bulk CD4* T cells after treatment with 
only N-803 (green, n=7 macaques), CD8 depletion alone (blue, 


in SIV-infected ART-treated macaques was consistently followed by 
increased plasma viraemia, thus indicating that these cells contribute 
to viral suppression under ART“. Although the precise mechanisms 
responsible for this observation remain unclear, phylogenetic analysis 
of the rebounding virus suggested that silencing of virus transcription 
contributes to this antiviral effect. On the basis of these observations, 
we hypothesized that the depletion of CD8* lymphocytes combined 
with LRAs may enhance virus production under ART. As shown in Fig. 1a, 
the IL-15 superagonist N-803 is a complex ofa mutant IL-15 anda dimeric 
IL-15 receptor aSu/Fc fusion protein”. The engineered structure is at 
least 25 times more biologically potent than IL-15 as it mimics transpre- 
sentation, and the lgG-Fc component confers improved in vivo safety 
and bioavailability*°”. In the setting of ART-suppressed lentiviral infec- 
tion, N-803 may target the residual virus pool through its ability to 
act in vitro as a potent LRA and to strengthen the antiviral immune 
responses mediated by T and natural killer (NK) cells”. 

The current study included a total of 35 SIV-infected macaques that 
started ART at day 56 after infection and were treated for at least 1 year 
before any further intervention. The macaques were divided in three 
groups as follows (Fig. 1b): 7 macaques were treated with 4 weekly doses 


n=14 macaques), and CD8 depletion combined with the administration of 
N-803 (red, n=14 macaques). h, Gene-set enrichment analysis of RNA- 
sequencing data from bulk CD4*T cells comparing gene sets enriched on day3 
after intervention with N-803 alone (green, n=7 macaques), CD8 depletion 
alone (blue, n=7 macaques) or CD8 depletion combined with N-803 treatment 
(red, n=7 macaques). i, Heat map of the enriched genes in bulk CD8° T cellsin 
the IL-2 and STAT-5 signalling gene set after administration of N-803 alone 
(n=7 macaques). Data are mean+s.e.m. Two-sided Kruskal-Wallis tests (d, f) 
and Friedman tests (e, g) were used to compare values after the interventions 
to the pre-intervention baseline and approximate P-value summaries are 
provided. 


of 100 pg kg“ of N-803 (group 1, N-803 alone), 14 macaques received 1 
dose of the CD8-depleting antibody, MT807R1 (anti-CD8a) at 50 mg kg? 
intravenously (group 2, CD8 depletion alone) and 14 macaques received 
4 weekly doses of N-803 starting at the time of CD8 depletion (group 3, 
CD8 depletion with N-803). After reconstitution of CD8’ T cells (defined 
as more than 100 CD8’T cells per pl of blood), 7 macaques in groups 2 
and 3 received four additional weekly administrations of N-803. Periph- 
eral blood samples, lymph node and rectal biopsies were collected 
at various time points before, during and after these interventions. 
All macaques underwent analytical treatment interruption at week 3 
after either CD8* T cell reconstitution or the last N-803 treatment. As 
shown in Fig. 1c, all macaques showed suppression of viraemia after 
lyear of ART, with plasma viral loads below the detectable limit of 
our standard assay (60 copies per ml of plasma)” at the time of the 
additional interventions in 33 out of 35 macaques (94.3%). We also 
measured residual plasma viraemia using an ultrasensitive assay (limit 
of detection of 3 copies per ml)”*” at three monthly sampling points 
before the interventions. Viraemia was below 3 copies per mlin19 out 
of 35 ART-treated macaques (52.3%), and 26 out of 35 macaques (74.3%) 
showed levels of residual viraemia of <10 copies per ml at the time of 
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Fig.2|SIV and HIV reactivation after CD8 depletion combined with N-803 
treatment. a-—c, Plasma viral loads after intervention with N-803 alone 

(n=7 macaques) (a), CD8 depletion alone (n=14 macaques) (b) or CD8 
depletion combined with N-803 (n=14 macaques) (c).d, Longitudinal plasma 
viral loads for macaques with fully suppressed viral load (<3 copies per ml of 
plasma) before CD8 depletion combined with administration of N-803. 

e, f, Comparison of viral load pre-intervention (pre), post-intervention when 
CD8* T cells are <100 cells per pl blood (post) and during CD8* T cell 
reconstitution >100 cells per jl blood (CD8 return) in macaques that 
underwent CD8 depletion alone (n=14 macaques) (e) or CD8 depletion with 
N-803 (n=14 macaques) (f). Data are mean +s.e.m. g, Plasma viral loads after 
CD8 depletion with N-803 administration in SHIV¢,,¢9p3-infected macaques 


intervention (Extended Data Table 1). These results indicate that the 
level of virus suppression observed in our cohort of macaques was in 
most cases comparable to that of long-term ART-treated HIV-infected 
individuals” *’. 

As shown in Extended Data Fig. 1la—d and consistent with previous 
studies'®*°, treatment with anti-CD8a MT807R1, with or without N-803, 
depleted on average 99.1% of CD3*CD8*' T cells in peripheral blood, 
97.9% in lymphnodes and 99.5% in rectal biopsies. In addition, treatment 
with MT807R1 alone depleted 93.2% of NK cells in peripheral blood 
(Extended Data Fig. le). As expected on the basis of previous studies, 
N-803 administration alone resulted in the expansion of CD8* T cells 
inthe blood and lymph nodes (Fig. 1d, e), as well as increased prolifera- 
tion of peripheral CD8* T cells, CD4* T cells and NK cells (Fig. 1f). Of 
note, although CD8 depletion alone did not result in a rapid increase 
in CD4* T cell proliferation (as measured by Ki-67 expression), the 
combination of CD8 depletion and treatment with N-803 led to a sig- 
nificant increase in CD4' T cell proliferation (Fig. 1g and Extended Data 
Fig. 2r-v). The frequency of CD4* T cells co-expressing Ki-67 and the 
HIV and SIV coreceptor CCR5—potential target cells for infection—was 
significantly increased across all groups by the third week after inter- 
vention (Extended Data Fig. 21). Additionally, CD8 depletion with or 
without N-803 administration resulted in the expansion of effector 
memory CD4* T cells and increased PD-1 expression on CD4* T cells 
(Extended Data Fig. 2f, g). 

To better characterize the biological effects of N-803, we conducted 
atranscriptional analysis using RNA sequencing of sorted CD4" T cells 
collected before intervention and day 3, week 2 and week 4 after the 
start of intervention. Regardless of concurrent CD8 depletion, N-803 
induced a significant upregulation of gene sets associated with cell 
cycling and proliferation, activation, antiviral responses and cell signal- 
ling (Fig. 1h). In CD8* T cells, N-803 resulted in significant enrichment of 
genes in the IL-2 and STAT-5 signalling gene set, whichis also indicative 
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after 6 months of ART (n=5 macaques). Viral suppression of <60 copies per ml 
is shownasa black bar (a-d, g) or dashed line (e, f) and the limit of detection for 
these assays was 3 copies per ml of plasma. h, RNAscope determination of the 
percentage of SIV RNA‘ lymph node cells expressing high levels (>4 copies) of 
viral RNA per cell 1 week after CD8 depletion combined with N-803 treatment. 
i-k, Plasma viral loads of HIV-infected, ART-treated humanized mice treated 
with N-803 alone (green, n=7 mice) (i), CD8 depletion alone (blue, n=8 mice) 
(j) or CD8 depletion with N-803 administration (red, n=8 mice) (k). The limit of 
detection was 346 copies per ml. Statistical significance was calculated using a 
two-sided Kruskal-Wallis test (e, f) or Wilcoxon signed-rank test (h). A key for 
the macaque ID codes is provided in Extended Data Fig. 5. 


of IL-15 signalling, as the receptor for this cytokine shares two out of 
three subunits with IL-2 and uses STAT-5 as the key adaptor molecule 
(Fig. 1i). In addition, we examined the expression of 25 genes specifi- 
cally involved in the host-virus interaction during SIV infection and 
found that N-803 administration induced a consistent and transient 
upregulation of APOBEC3 in CD4* T cells, CD8* T cells and NK cells 
(Extended Data Fig. 3a-c). 

As shown in Fig. 2a and Extended Data Table 1, administration of 
N-803 was not associated with an increase of plasma viraemia of 
>60 copies per ml in any of the treated macaques, indicating that the 
IL-15 superagonist is not sufficient to exert an in vivo LRA effect in ART- 
treated SIV-infected macaques when used alone. As expected on the 
basis of previous studies, macaques undergoing CD8 depletion alone 
showed a moderate but significant increase in virus production, with 
plasma viraemia of >60 copies per ml detected in 11 out of 14 macaques 
(78.6%) and 18 out of 56 samples (32.1%) collected weekly after CD8 
depletion (Fig. 2b). Viraemia of >1,000 copies per ml was observed in 
2 out of 14 macaques (14.2%) and 2 out of 56 (3.6%) of the same samples 
(Fig. 2b). In all cases, the level of virus production returned to below 
60 copies per ml of plasma at the time of CD8' T cell reconstitution 
(Fig. 2e). Overall, the level of increased viraemia observed in this study 
was consistent with previous studies’’, even though the magnitude of 
virus production after CD8 depletion was slightly less marked, possibly 
related to the longer period of ART treatment (12 months compared 
with 2-8 months)’. Notably, macaques treated with N-803 during 
CD8 depletion showed highly robust and persistent levels of virus 
production, with viraemia of >60 copies per ml detected in 14 out of 
14 macaques (100%) and 41 out of 56 samples (73.2%) and viraemia of 
>1,000 copies per ml observed in 6 out of 14 macaques (42.9%) and 
13 out of 56 samples (23.2%) (Fig. 2c). We emphasize that all seven 
macaques with full suppression of virus production at the time of 
intervention with CD8 depletion and N-803 administration (that is, 
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Fig. 3|In vitroco-culture of latently infected human CD4'T cells with 
autologous CD8‘T cells results in decreased expression of HIV Gag during 
LRA administration. a, Schematic of HIV latency model. Memory CD4' T cells 
(mCD4"*) were enriched on day 0, infected in vitro on day 3 with HIV,,,,and 
maintained in saquinavir. On day 6, TGFB, IL-7, conditioned medium fromthe 
H-80 feeder cell line and additional antiretroviral drugs were added tothe 
culture. Cryopreserved autologous peripheral-blood mononuclear cells 
(PBMCs) were thawed on day 8 and rested overnight before enriching for total 
CD8* (tCD8’*) cells and then TCR-activated for 3 days. On day 12, HIV-infected 
mCD4* and TCR-activated total CD8* cells were co-culturedina1:lor 1:5 ratio in 
the presence of anti-CD3/CD28 antibody, N-803 or recombinant IL-15 until day 
15. b, HIV Gag*CD4' T cell frequency was quantified using flow cytometry and 
the frequency in co-cultures was calculated as a fold change compared to CD4 
Tcell monoculture. Each colour represents a unique donor (n=8 biologically 
independent samples) and data are mean +s.e.m. (indicated by the grey bars). 
Statistical significance was measured using a matched one-way analysis of 
variance (ANOVA). 


repeated viral load measurements <3 copies per ml of plasma) dem- 
onstrated clear virus reactivation with detectable levels in 26 out of 
28 time points 1 week after each N-803 administration, and >60 cop- 
ies per mlin 16 out of 28 of the same samples (Fig. 2d). Similar results 
were obtained in a smaller pilot study in which N-803 administration 
during CD8 depletion was performed in five ART-suppressed SHIVe-169p3- 
infected macaques (Fig. 2g). The level of viraemia observed in SIV- 
infected macaques treated with combined CD8 depletion and N-803 
administration during long-term ART is higher and more persistent 
than the results of previous shock-and-kill cure strategies tested in 
humans and nonhuman primates® 371, 

After the last treatment with N-803, the level of viraemia rapidly 
decreased coincident with the reconstitution of CD8’ T cells, and all 
macaques returned to <60 copies per ml by week 6 after CD8 deple- 
tion and N-803 administration (Fig. 2c, f). As expected, CD8* T cell 
reconstitution was faster in CD8-depleted macaques co-treated with 
N-803 (Extended Data Fig. 5e-g), as the IL-15 superagonist enhances 
CD8° T cell proliferation (Fig. 1d-f). 

We next investigated the correlates of virus reactivation in ART- 
treated SIV-infected macaques that underwent CD8 depletion with 
N-803, and observed that the post-depletion viral load (day 3 to week 6) 
was negatively correlated with the frequency of peripheral CD8’ T cells 
(Extended Data Fig. 5b). Additionally, the area under the curve of virus 
production during CD8 depletion and administration of N-803 was 
directly correlated with pre-intervention viraemia (Extended Data 
Fig. 5d). Of note, no correlation was found between the level of virus 
production after CD8 depletion and/or N-803 treatment and either the 
size of the peripheral blood DNA reservoir (measured as the fraction 
of SIV DNA*CD4* T cells) or the level of CD4* T cell activation (meas- 
ured as Ki-67 or PD-1 expression on CD4* T cells) (data not shown). To 


assess whether combined CD8 depletion with N-803 administration 
induced SIV production in lymphoid tissues, we next analysed the levels 
of SIV RNA inthe lymph nodes using the RNAscope technology at pre- 
intervention and day 7 after intervention in5 representative macaques 
that were both depleted of CD8* lymphocytes and treated with N-803. 
As shown in Fig. 2h, and consistent with the measurements of plasma 
viraemia, we found a statistically significant increase in the percent- 
age of SIV RNA‘ cells with high levels of SIV RNA after intervention. 
No changes were observed in the level of SIV RNA in peripheral CD4* 
T cells (Extended Data Fig. 4a—c), suggesting that lymphoid tissues are 
the main source of reactivated virus after the combined CD8 depletion 
and administration of N-803. 


HIV infection of humanized mice 


To confirm the virus reactivation induced by the combined depletion 
of CD8* lymphocytes and administration of N-803 in SIV-infected ART- 
treated macaques in an in vivo model using HIV, we next conducted 
a similarly designed experiment using bone-marrow-liver-thymus 
(BLT) humanized mice infected with HIV-1jp.csp and treated with ART. 
As shown in Fig. 2i-k, HIV-infected humanized mice showed markedly 
similar results to those obtained in SIV-infected macaques, with no 
plasma virus reactivation after the administration of N-803 alone, a 
moderate level of virus reactivation after CD8 depletion alone, and a 
robust level of virus reactivation in 7 out of 8 (87.5%) humanized mice 
that were depleted of CD8* lymphocytes and treated with N-803. Fur- 
thermore, we also noted astatistically significant increase in the levels 
of cell-associated HIV RNA inthe spleen and human-derived thymus of 
humanized mice that were depleted of CD8* lymphocytes and treated 
with N-803 (Extended Data Fig. 6b). 


CD8° T cells inhibit latency reversal in vitro 


The combined data obtained in SIV-infected macaques and HIV-infected 
humanized mice indicate that the strong virus reactivation activity 
attributed to N-803 is revealed only inthe absence of CD8' T cells, thus 
delineating a novel mechanism of latency maintenance and/or inhibi- 
tion of latency reversal mediated by CD8* lymphocytes. To recapitu- 
late this observation in a reductionist in vitro model of HIV latency in 
human cells, we used the recently developed latency and reversal assay 
(LARA)* to evaluate how CD8*' T cells affect the virus reactivation activ- 
ity of N-803 in autologous memory CD4* T cells latently infected with 
HIV 9, (Fig. 3a). Of note, this assay was conducted using cells derived 
from HIV-negative donors, thus HIV-specific cytotoxic CD8’ T cells were 
absent. Whereas N-803 (and its biological counterpart, IL-15) reactivates 
HIV expression in latently infected CD4* T cell monocultures, co-culture 
with activated CD8* T cells significantly suppresses this ability (Fig. 3b). 
These data indicate that CD8" T cells effectively suppress the latency- 
reversing activity of N-803, and therefore confirm the discovery of a 
previously unrecognized CD8* T-cell-mediated activity that contributes 
to the maintenance of lentivirus latency in vivo in primates. 


Virus sequence analysis 


To investigate the viral dynamics associated with reactivation after 
CD8 depletion and administration of N-803 in SIV-infected ART-treated 
macaques, we performed a longitudinal sequence analysis of plasma 
virus using single-genome PCR amplification of the SIV,,4-739-derived 
env genes. The viral env was sequenced at three pivotal time points: 
day 7 after infection (at peak viraemia), day 56 after infection—that 
is, immediately before ART initiation (pre-ART)—and during peak 
virus reactivation. We conducted this analysis in six macaques that 
showed robust virus reactivation (plasma viraemia of >800 copies 
per ml) (Fig. 2c). Extended Data Figure 7 shows phylogenetic analysis 
of the translated Env amino acid sequences of the circulating viruses. 
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Fig. 4| CD8 depletion combined with N-803 administration does not 
decrease the size of the latent SIV viral reservoir. a—f, Copies of cell- 
associated SIV DNA per 10° cells was determined in peripheral CD4* T cells (a-c) 
and frozen bulk lymph node cells (d-f) in macaques treated with N-803 alone 
(n=7 (a) andn=3 (d) macaques), CD8 depletion alone (n= 14 (b) andn=5(e) 
macaques), and CD8 depletion with N-803 treatment (n=14 (c) andn=3 (f) 
macaques). Two-sided Friedman tests (a-c) and Wilcoxon signed-rank tests 
(d-f) were used to determine statistical significance between pre-treatment 


The peak viral load Env sequences were homogeneous and nearly 
identical to the input SIV,,,,.239 sequence; however, the diversity and 
number of informative sites at subsequent time points were limited, 
such that sequences could not be clustered on the basis of time point 
with significant bootstrap support. The diversity at each time point, 
although limited, was quantified by determining the number of amino 
acid differences from the input SIV,,,,.039 Sequence. Extended Data Fig- 
ure 8a shows that, for all macaques, the peak viraemia Env sequences 
are the least different from the input virus, as expected, and increased 
divergence was observed at the pre-ART time point and after reactiva- 
tion. We next calculated the average number of amino acid differences 
from SIV, nac239 at each time point, and compared these with contempo- 
raneous plasma viraemia for each macaque in a correlation matrix. 
The only significant association was a direct relationship between Env 
divergence and plasma viraemia during reactivation (Extended Data 
Fig. 8b). Finally, Extended Data Fig. 9 shows the location of sequence 
changes at each time point using highlighter plots of the Env amino 
acid sequences. Overall, this analysis supports the hypothesis that CD8 
depletion and administration of N-803 induces robust reactivation of 
a diverse population of viral variants. As no signs of virus evolution 
emerged from the longitudinal sequence analysis during high levels 
of latency reversal, it is unlikely that the rebounding virus is a product 
of de novo viral replication. Supporting this hypothesis, the combined 
CD8 depletion and administration of N-803 did not increase the levels 
of two-long-terminal repeat circles—which are considered a marker 
of recent lentivirus infection—in peripheral-blood mononuclear cells 
(data not shown). 


Analytic treatment interruption 


To determine whether the interventions used induced a decrease in 
the virus reservoir, we first longitudinally measured the level of cell- 
associated SIV DNA in blood and lymph nodes. As shown in Fig. 4a-f, 
none of the experimental groups showed significant changes in either 
the total fraction of circulating CD4*T cells or the calculated fraction 
of lymph-node-derived cells that contained SIV DNA. To functionally 
assess the effect of the treatment regimens on the reservoir size, we 
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and post-treatment time points. For all comparisons, P>0.05.Sample 

mean +s.e.m. are indicated by grey bars. g-i, Viral rebound after interruption 
of ART (indicated by arrowheads) in macaques that received N-803 alone 

(n=7 macaques) (g), CD8 depletion alone (n=13 macaques) (h) or CD8 
depletion with N-803 (n=14 macaques) (i). ATI, analytical treatment 
interruption. The limit of detection was <3 copies of SIV RNA per ml plasma. 
The black bar indicates viral loads <60 copies per ml and the grey box indicates 
ART. 


performed an analytical treatment interruption in all macaques three 
weeks after either CD8 reconstitution and/or the last N-803 treatment. 
As shown in Fig. 4g-i, all macaques rebounded within three weeks of 
ART interruption and most macaques sustained high viral loads until 
the time of necropsy. It should be noted that in the current study, ART 
was initiated at day 56 after infection, thus substantially later than in 
other published macaque studies that included analytical treatment 
interruption, and thus in the setting of a larger and more-disseminated 
reservoir'®’, The rapid rebound after ART interruption was therefore 
not unexpected as the experimental design was focused on assessing 
the shock effect of CD8 depletion combined with N-803 administra- 
tion, with no anticipated effect on the reservoir size in the absence of 
an intervention aimed at clearing the cells that have reactivated SIV 
production (kill phase of the shock-and-kill approach). The absence of 
a decrease in the level of SIV DNA‘ cells after the combined CD8 deple- 
tion and administration of N-803 may be due to the lack of CD8* T-cell- 
mediated clearance of cells that have reactivated virus expression and/ 
or the N-803-mediated proliferative expansion of infected CD4*T cells 
that have survived the events of virus reactivation. 


Discussion 


The current paradigm for shock-and-kill interventions for an HIV cure 
suggests that reactivation of virus transcription in latently infected cells 
is the first essential step to eliminate the persistent reservoir of repli- 
cation-competent virus in ART-treated HIV-infected individuals. In this 
study, we have shown that the administration of the IL-15 superagonist 
N-803 in both SIV-infected macaques and HIV-infected humanized mice 
induces a highly robust and persistent reversal of latency only in the set- 
ting of CD8* lymphocyte depletion, thus suggesting a substantial role 
for CD8* lymphocytes in suppressing the LRA effect of N-803. Notably, 
this previously undescribed role of CD8* lymphocytes in maintaining 
virus latency was fully reproduced in an in vitro experimental approach 
that involved the co-culture of activated, unprimed CD8*° T cells with 
autologous, latently HIV-infected human primary CD4* T cells. Here 
we use a novel approach to manipulate latently infected cells and have 
independently confirmed this method in the two best-validated and 


most widely used in vivo models for HIV cure interventions and then 
recapitulated the results in an in vitro experimental system of virus 
latency. In addition to this conceptual advance, this study defines a 
robust shock approach that could provide a key experimental system to 
directly compare and contrast the efficacy of different killinterventions 
in vivo, including those that may act ina CD8* T-cell-independent man- 
ner (suchas neutralizing antibodies, CD4 mimetics or immunotoxins). 
Further studies that aim to identify the specific molecular pathways 
used by CD8* T cells to promote latency may allow the suppression of 
this activity, and therefore enable the full use of the virus-reactivating 
potential of N-803 or other LRAs in the clinical setting without deplet- 
ing CD8* lymphocytes. 
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Methods 


Rhesus macaques, SIV-infection, ART, CD8 depletion and 
administration of N-803 

This study was conducted using a cohort of 35 Indian-origin rhesus 
macaques housed at Yerkes National Primate Research Center (male 
and female, 2-3 years of age at the start of the study). All macaques were 
Mamu*BO7 and Mamu*BI17 ; the following macaques were Mamu*AOI': 
77 13, RFr15, 208 13, RPb16, RJt15, RHv15, RAu15, RNal6, RNz15, ROr15, 
RRb16, RSt15, RAk16, RUs15, RYe16, RVZ15, RBn16 and RRn16. All proce- 
dures were approved by the Emory University Institutional Animal Care 
and Use Committee (IACUC) and animal care facilities are accredited by 
the US Department of Agriculture and the Association for Assessment 
and Accreditation of Laboratory Animal Care International. 

Rhesus macaques were infected intravenously with 3,000 TCID,, of 
SlVnaco39 (Nefopen). SIV naco39 Stock was titrated in vitro for viral infectiv- 
ity by standard end-point titration on CEMx174 cells. The TCID;, was 
calculated using a previously published method™. All macaques were 
put ona three-drug ART regimen at 8 weeks after SIV infection. Teno- 
fovir disoproxil fumarate (5.1 mg kg” per day or tenofovir, 20 mg kg? 
per day) and emtricitabine (40 mg kg" per day) were both provided 
by Gilead Pharmaceuticals. Dolutegravir (2.5 mg kg? per day) was 
provided by ViiV Pharmaceuticals. Drugs were administered daily by 
subcutaneous injection. 

After over 12 months of ART, macaques were assigned to intervention 
groups. Age, weight, sex, AO1 status, peak post-infection viral load and 
time to suppression after ART were all controlled for when allocat- 
ing macaques to intervention groups. Group sizes were determined 
using a power analysis based on previous studies. No blinding was 
performed in this study. One dose of the anti-CD8a-depleting antibody, 
MTS8O7R1 (50 mg kg“), was administered to 28 macaques. The initial 
15 macaques that received the depleting antibody received the admin- 
istration intravenously. Owing to safety concerns, the administration 
was changed to subcutaneous for the remaining 13 macaques. There 
was no observable effect of the different administration routes on the 
efficacy of depletion. 

Atthe start of the intervention, 21 macaques received a dose of N-803 
either in addition to CD8 depletion (n = 14) or as a single treatment 
(n=7).N-803 was administered subcutaneously ina cycle of 100 pg kg" 
once a week for 4 consecutive weeks. 

The study design included a subsequent four-dose administration 
of N-803 in seven macaques of groups 1 and 2 that was carried out at 
the time of CD8* T cell reconstitution to potentially accelerate the 
recovery of these cells and improve their antiviral cytotoxic potential. 
As expected, this second cycle of N-803 induced a faster recovery of 
CD8* T cells (data not shown) and was associated with an increase in 
T cell activation and proliferation that was similar to the increased 
activation and proliferation observed after the first N-803 cycle (data 
not shown). The late administration did not result in an increase in 
plasma viraemia. 

Macaques were interrupted of ART 3 weeks after either the last dose 
of N-803 or 3 weeks after the reconstitution of CD8* T cells (whichever 
occurred first). Plasma viral loads were monitored for about 6 months 
until necropsy was performed. 


CD8 depletion in combination with N-803 treatment in SHIV- 
infected ART-treated macaques (pilot study) 

An additional five Indian-origin rhesus macaques were included in 
this study as part of a follow-up pilot study using a SHIV model of 
infection. These macaques were also housed at the Yerkes National 
Primate Research Center and all procedures were approved by the 
Emory University IACUC. Macaques were infected intrarectally with 
high-dose SHIV¢-;¢2p3, administered as a 1:50 dilution of a 2,032 TCID;, 
per ml, 10° RNA copies per ml, 182 ng mI P27 stock. All macaques were 
placed on the same tenofovir, emtricitabine and dolutegravir ART 


regimen 12 weeks after SHIV infection. After 6 months of ART, macaques 
received one dose of MT-807R1 at 50 mg kg subcutaneously. N-803 
was administered subcutaneously ina cycle of 100 pg kg! once a week 
for 4 consecutive weeks starting at the time of CD8 depletion. 


Collection and processing of tissues from macaques 

Blood, lymph nodes and rectal biopsies were collected longitudinally 
including at the time of necropsy and processed for further analyses 
as previously described’. 


Immunophenotypes of macaques by flow cytometry 
Multiparametric flow cytometry was performed according toa stand- 
ard protocol on PBMCs and lymph node mononuclear cells using flu- 
orescently labelled monoclonal antibodies cross-reactive in rhesus 
macaques. The following antibodies were used at 37 °C for 30 min: 
CCRS APC (BD Biosciences, 560748, clone 3A9) and CCR7 FITC (BD 
Biosciences, 561271, clone 150503), in addition to LIVE/DEAD aqua 
viability dye (ThermoFisher, L35957) . The following antibodies were 
subsequently used at room temperature for 30 min: CD3 APC-Cy7 (BD 
Biosciences, 557757, clone SP34-2), CD4 BV650 (Biolegend, 317436, 
clone OKT4), CD8a BV711 (Biolegend, 301044, clone RPA-T8), pure 
CD88 (Thermo, 14-5273-82, clone SIDI8SBEE) conjugated to Pe-Cy5 via 
akit (Innova Biosciences, 760-0010), CD45RA Pe-Cy7 (BD Biosciences, 
561216, clone 5H9), CD62L PE (BD Biosciences, 341012, clone SK11), 
CD95 BV605 (Biolegend, 305628, clone DX2), PD-1 BV421 (Biolegend, 
329920, clone EH12.2H7), CD16 BV421 (BD Biosciences, 562874, clone 
3G8), CD20 PE-Cy5 (BD Biosciences, 555624, clone 2H7), CD14 PE-Cy7 
(BD Biosciences, 557742, clone M5E2), NKG2A (also known as CD159) 
PE (Beckman Coulter, IM3291U, clone Z199), CD28 PE-Cy5.5 or ECD 
(Beckman Coulter, B24027 and 6607111, respectively, clone CD28.2), 
CD56 FITC (BD Biosciences, 340723, clone NCAM16.2). Additional pan- 
els included CD69 Pe-Cy5 (BD Biosciences, 555532, clone FN5O) and 
CD25 APC (BD Biosciences, 555434, clone M-A25]1). Cells stained for 
Ki-67 were fixed and permeabilized with Perm II kit (BD Biosciences) 
before staining at room temperature for 30 min with Ki-67 AF700 (BD 
Biosciences, 561277, clone B56). 

All flow cytometry specimens were acquired on an LSR II (BD Bio- 
sciences) equipped with fluorescence-activated cell sorting (FACS) 
software (FACS Diva), and analysis of the acquired data was performed 
using FlowJo software (TreeStar). 


Determination of plasma SIV RNA, and cell-associated RNA and 
DNA 

For pre-intervention time points, quantitative PCR with reverse tran- 
scription (RT-qPCR) was performed to determine SIV plasma viral 
load as previously described” with a sensitivity of 60 copies per ml. 
For the three time points before intervention and all post-intervention 
time points, plasma SIV gag RNA levels were measured high-sensitivity 
assay formats” ”°, Quantification of total cell-associated SIV nac730 LAL 
DNA was performedas previously described®. The number of gagDNA 
copies per 10° CD4* T cells was calculated by dividing the number of 
gag DNA copies per 10° PBMCs by the percentage of CD3*CD8 CD4* 
cells in the PBMC population. CD4" T cells were isolated from PBMCs 
using a CD4' T cellisolation kit (Miltenyi) and cell-associated RNA was 
measured as previously described**. 


Insitu RNA analysis and quantification 

Viral RNA (vRNA) detection using RNAscope and quantitative image 
analysis was performed on formaldehyde-fixed, paraffin-embedded 
tissue sections (5 jm) as previously published”, with the following 
minor modifications: heat-induced epitope retrieval was performed 
by boiling slides in 1x target retrieval (322000; ACD) for 30 min, fol- 
lowed by incubation at 40 °C with a1:10 dilution of protease Ill (322337; 
ACD) in 1x PBS for 20 min. Slides were incubated with the target probe 
SlVnacz39 (312811; ACD) for 2h at 40 °C and amplification was performed 


with RNAscope 2.5 HD Detection kits (322360; ACD) according to the 
manufacturer’s instructions, with 0.5 wash buffer (310091; ACD) 
used between steps. The resulting signal was detected with Warp Red 
chromogen (WR806M; Biocare Medical). Slides were counterstained 
with CAT haematoxylin (CATHE-GL; Biocare Medical), mounted with 
Clearmount (17885-15; EMS) until dry, after which a coverslip was added 
and sealed using Permount (SP15-100; Fisher Scientific). Slides were 
scanned at 40x magnification on an Aperio AT2 (Leica Biosystems). 
RNAscope images were analysed for the total number of vRNA‘ cells per 
10° total cells (quantitative) and the relative amount of vRNA present 
(semiquantitative) using the ISH module (v.2.2) within the Halo software 
(v.2.3.2089.27; Indica Labs). The relative amount of vRNA withina single 
cell was first estimated by quantifying the total area of the vRNA signal 
spot size (jum). As the signal spot size is a function of several steps in 
the experimental procedures, module settings were established on 
concomitantly assayed, acutely infected SIV‘ control slides. To estimate 
the signal spot size of a single VRNA molecule, we measured the signal 
area (minimum, mean and maximum) of more than 10 identifiable indi- 
vidual virions within B cell follicles, which corresponds to two copies of 
vRNA, and multiplied this by 0.5. We set the VRNA minimum signal spot 
size within the analysis module to exclude detection of a single VRNA 
molecule and/or integrated viral DNA. Relative VRNA copy numbers 
that were present within vRNA‘ cells were calculated as (signal spot size 
within vRNA‘ cell (tum?))/(0.5 x mean signal size for a virion). 


FACS of live cells from macaques 

Mononuclear cells isolated from blood were stained with LIVE/DEAD 
aqua viability dye (ThermoFisher, L35957), CD3 AF700 (BD Biosciences, 
557917, clone SP34-2), CD4 BV650 (Biolegend, 317436, clone OKT4), 
CD8 APC-Cy7 (BD Biosciences, 557834, clone SK1), CD14 PB (Biolegend, 
301828, clone M5E2), CD20 PB (Biolegend, 302328, clone 2H7) and CD16 
BV421 (Biolegend, 302032, clone 3G8) for 30 min at room temperature. 
Aliquots of 50,000 CD4* T cells (live CD3*CD20°CD14- CD16 CD8 CD4*) 
and CD8‘* T cells (live CD3*CD20 CD14 CD16 CD4 CD8*) were then 
sorted using a FACS Aria II (BD Biosciences). Mononuclear cells were 
separately stained with Live/Dead stain, CD3 AF700 (BD Biosciences, 
557917, clone SP34-2), CD4 BV650 (Biolegend, 317436, clone OKT4), 
CD8 APC-Cy7 (BD Biosciences, 557834, clone SK1), CD14 PB (Biolegend, 
301828, clone MSE2), CD20 PB (Biolegend, 302328, clone 2H7) and 
NKG2A PE (Beckman Coulter, IM3291U, clone Z199) to sort aliquots of 
50,000 NK cells (live CD3°CD20-CD14-CD4 CD8*NKG2A’). 


RNA sequencing and data analysis 

Bulk CD4* T cells were sorted from fresh PBMCs before intervention, 
and on day 3, week 2 and week 4 post-intervention. In brief, RNA from 
sorted cells was collected and extracted, and the DNA was digested. 
Libraries were prepared and normalized, pooled and clustered ona 
flow cell for sequencing. RNA-sequencing data were aligned to the 
MacaM v.7.8 assembly of the Indian rhesus macaque genome. To iden- 
tify pathways that were differentially modulated, gene-set enrichment 
analysis®® was performed on the ranked transcript lists using 1,000 
phenotype permutations and random seeding. Gene sets used included 
the MSigDB H (hallmark) gene sets”. Normalized enrichment scores 
for select upregulated gene sets were used, for which the normaliza- 
tion was group-specific. A normalized enrichment score cut-off of 
greater than 1.35 for upregulated gene sets with a false-discovery rate 
of less than 0.2 was used, as per the recommendations for gene-set 
enrichment analysis. For the generation for heat maps, colours repre- 
sent log,-transformed library-size-normalized read counts scaled to 
unit variance across transcript vectors and normalized to the baseline 
median sample value of each transcript. 


Single-genome PCR amplification of SIV,,.-235 env sequences 
cDNA synthesis and 384-well single-genome PCR amplification (SGA) 
were performed using an approach similar to previously described 


methods*°”. In brief, RNA was extracted from cryopreserved plasma 
samples using the QIAmp viral RNA kit (Qiagen, 52906) and reverse 
transcription was performed using the SuperScript Ill kit (Invitrogen, 
18080-044) with reverse primer SM-ER1 (5’-CTATCACTGTAATAAATCC- 
CTTCCAGTCCC-3’). cDNA was diluted to result in less than 30% positive 
wells for SGA. First-round PCR was performed in a 15-pl volume using 
the Phusion Hotstart II High Fidelity DNA Polymerase (Thermo Scien- 
tific, F537S) with forward primer H2SM-EF1 (5’-CCCTTGAAGGMGCM- 
RGAGAGCTCATTA-3’) and SM-ER1. Cycling conditions were as follows: 
98 °C for 2 min; 10 cycles of 95 °C for 15 s, 54 °C for 60 s and 68 °C for 
4 min; 25 cycles of 95 °C for 15 s, 54 °C for 60 s and 68 °C for 4 min, 
adding 5 s to the extension per cycle; 72 °C for 30 min; and 4 °C hold. 
Second-round PCR was performed with the same enzyme ina 10-l 
volume with 1 pl of the first-round PCR reaction as template and primers 
H2SM-EF2 (5’-CACCTAAAAARTGYTGCTAYCAT TGCCAG-3’) and SM-ER2 
(5’-ATAAAATGAGACATGTCTATTGCCAATTTG-3’). Cycling conditions 
were as follows: 95 °C for 2 min; 30 cycles of 95 °C for 15s, 54 °C for 60s 
and 72 °C for 2.5 min; 72 °C for 10 min; and 4 °C hold. PCR amplicons 
were purified using a Qiaquick PCR Purification Kit (Qiagen 28106). 


Sequencing of env amplicons 

On average, 26 SGA PCR amplicons per time point (range, 20-30) were 
sequenced by Eurofins Genomic DNA Sanger sequencing using the 
following primers: SIVmac251seqF1 5’-GGGATATGT TATGAGCAGT- 
CACG-3’; SIVmac251seqF2 5’-ATCCAAGAGTCT TGTGACAAGC-3’; 
SIVmac251seqF3 5’-AAGAGAGGGAGACCTCACG-3’; SIVmac251seqF4 
5’-AGGCCAGTGT TCTCT TCC-3’; SIVmac251seqR1 5’-CTTGT TCCAA- 
GCCTGTGC-3’; SIVmac251seqR2 5’-CCTCTGCAAT TTGTCCACATG-3’; 
SIVmac251seqR3 5’-TCCAAGAAGTCAACCTT TCGC-3’; SIVmac251seqR4 
5’-AGCTGGGTTTCTCCATGG-3”8. Sequencher v.5.1 was used to generate 
nucleotide sequence contigs and sequences with mixed peaks in the 
chromatogram were excluded from further analysis. 


Sequence analysis of SIV,,.235 contigs 

Geneious v.9.1.7 was used to translate nucleotide sequences into amino 
acids and generate alignments. Amino acid alignments were exported 
from Geneious in FASTA format and used to generate highlighter 
plots to visualize amino acid mismatches (http://www. hiv.lanl.gov/ 
content/sequence/HIGHLIGHT/highlighter_top.html). Phylogenetic 
neighbour-joining consensus trees (Jukes—Cantor, resampling with 
100 bootstrap replicates) were created in Geneious using amino acid 
alignments, and were exported in NEXUS format into Figtree v.1.4.4 
for further modification (A. Rambaut, http://tree.bio.ed.ac.uk/). Phy- 
logenetic trees were presented as unrooted or were rooted on the 
midpoint. Bootstrap values that were higher than 80% are considered 
significant. Pairwise differences between the infecting SIV,,4.239 clone 
and each SGA-derived Env amino acid sequence were determined 
using Geneious. 


Experimental design of the BLT humanized mouse model 

The BLT humanized model of HIV infection was used to determine the 
efficacy of CD8 depletion alone, CD8 depletion in combination with 
N-803 or N-803 alone as aLRA. BLT mice (15-19 weeks after humaniza- 
tion surgery) were exposed to HIV-1)p.cs- intravenously. ART was initiated 
4-5 weeks later. Viraemia was durably suppressed by ART for 4 weeks. 
Asingle dose of N-803, CD8-depleting antibody or the combination of 
N-803 and CD8-depleting antibody was administered to HIV-infected 
and suppressed mice as indicated below. HIV RNA induction was meas- 
ured on days 4 and 7. 


Generation of BLT humanized mice 

BLT humanized mice were prepared as previously reported* **. In 
brief, a1-2-mm piece of human liver tissue was sandwiched between 
two pieces of autologous thymus tissue (Advanced Bioscience 
Resources) under the kidney capsule of sublethally irradiated (200 cGy) 
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12-15-week-old NOD.Cg-Prkdc™Il2rge"!"/SzJ (NSG; The Jackson Labo- 
ratory) mice. After implantation, mice were transplanted intravenously 
with haematopoietic CD34" stem cells isolated from autologous human 
liver tissue. Human immune cell reconstitution was monitored in the 
peripheral blood of BLT mice by flow cytometry every 3-4 weeks as 
previously described*? *°. Mice were maintained under specific-path- 
ogen-free conditions by the Division of Comparative Medicine at the 
University of North Carolina, Chapel Hill. Mouse experiments were 
conducted in accordance with NIH guidelines for the housing and care 
of laboratory animals and in accordance with protocols reviewed and 
approved by the IACUC at the University of North Carolina, Chapel Hill. 


Production of HIV and infection of BLT mice 

Stocks of HIV-Lpcsp Were prepared as previously reported* *. The pro- 
viral clone was transfected into human embryonic kidney (HEK)293T 
cells using Lipofectamine 2000 (Invitrogen, 11668030) following the 
manufacturer’s protocol. Viral supernatants were collected 48 h after 
transfection and titred onto TZM-bl indicator cells in triplicate to deter- 
mine the tissue-culture infectious units (TCIU) per ml. At least two 
different titre determinations were performed for each virus stock. 
BLT mice were exposed to 3 x 10* TCIU HIV-1jp.cs¢ by tail-vein injection. 


Analysis of HIV infection in BLT mice 

The peripheral-blood plasma viral load was monitored longitudinally by 
quantitative real-time PCR using a TaqMan RNA-to-C, 1-step kit (Applied 
Biosystems, 4392656). The sequences of the forward and reverse prim- 
ers and the TaqMan probe for PCR amplification and detection of HIV 
gag RNA were: 5’-CATGT TT TCAGCAT TATCAGAAGGA-3’, 5’-TGCTT- 
GATGTCCCCCCACT-3’ and 5’-FAM-CCACCCCACAAGATT TAAACAC- 
CATGCTAAQ-3’, respectively. Known quantities of HIV gag RNA were 
runin parallel, creating a standard curve for HIV gag and sample RNA 
levels were quantified by extrapolation from the standard curve. All 
samples were run and analysed on an ABI 7500 Fast Real-time PCR Sys- 
tem (Applied Biosystems). 

HIV DNA levels were measured in tissue cells collected at the time 
of collection and cryopreserved in cryopreservation medium (10% 
DMSO:90% fetal bovine serum). Cells were thawed slowly, counted 
by Trypan exclusion, aliquoted and pelleted. DNA was extracted from 
cell pellets using the QlIAamp DNA Blood Mini Kit (Qiagen) accord- 
ing to the manufacturer’s instructions. RT-PCR was performed with 
a TaqMan Fast Universal PCR Master Mix (Applied Biosystems). The 
sequences of the forward primer, reverse primer and the TaqMan 
probe for amplification and detection of HIV gag DNA were: 5’-CAT- 
GTTTTCAGCAT TATCAGAAGGA-3’, 5’-TGCT TGATGTCCCCCCACT-3’ and 
5’-FAM CCACCCCACAAGATTTAAACACCATGCTAA Q -3’, respectively. 
As acontrol, Homo sapiens haemoglobin subunit gamma-2 (HBG2) 
was included to quantify the presence of human DNA in each sample. 
The sequences of the forward primer, reverse primer and the TaqMan 
probe for amplification and detection of HBG2 were 5’-CGCTTCTG- 
GAACGTCTGAGATT-3’, 5’-CCTTGTCCTCCTCTGTGAAATGA-3’ and 
5’-FAM-TCAATAAGCTCCTAGTCCAGAC-3’, respectively. All samples 
were run and analysed on an ABI 7500 Fast Real-time PCR System 
(Applied Biosystems). 


ART administration in BLT mice 

ART was administered to BLT humanized mice as previously 
described” *’, via 12.7-mm pellets of irradiated Teklad chow con- 
taining emtricitabine (1,500 mg kg“), tenofovir disoproxil fumarate 
(1,560 mg kg”) and raltegravir (600 mg kg”) (Research Diets). 


N-803 and MT807R1 administration in BLT mice 

N-803 (0.2 mg kg‘ in PBS) and the control vehicle (PBS) were admin- 
istered to mice intravenously in a total volume of 200 pl. MT807R1 
(3mg kg ‘in PBS) and the control vehicle (PBS) were also administered 
intravenously in a total volume of 200 ul. 


Immunophenotypic analysis of BLT mice 

Immunophenotyping was performed on peripheral-blood samples 
longitudinally and at the time of collection on blood and mononuclear 
cells isolated from the tissues of BLT mice. All flow cytometry data were 
collected onaBD FACSCanto instrument using BD FACSDiva software 
(v.6.1.3) and data were analysed with FlowJo Software (v.10.5.0). Anti- 
bodies for the analysis of human immune cell levels included: CD45 
APC (BD 555485, clone HI30), CD3 FITC (BD Biosciences, 555339, clone 
HIT3a), CD4 APC-Cy7 (BD Biosciences, 560158, clone RPA-T4), CD33 PE 
(BD Biosciences, 340679, clone P67.6); CD19 PE-Cy7 (BD Biosciences, 
557835, clone SJ25C1) and CD8 PerCP (BD Biosciences, 347314, clone 
SK1). Flowcytometry gating for the expression of lineage-specific anti- 
gens on human leukocytes was performed as follows. Step 1, forward 
and side scatter were used to set a live-cell gate. Step 2, live cells were 
then analysed for the expression of the human pan-leukocyte marker 
CD45. Step 3, human leukocytes were then analysed for human CD3* 
T cells and CD19" B cells. Step 4, T cells were analysed for human CD4 
and CD8 expression. The following flow cytometry antibody panel was 
also used to analyse HLA-DR, CD38 and CD25 expression: CD3 BV421 
(BD Biosciences, 562426, clone UCTH1), CD4 BV605 (BD Biosciences, 
562658, clone RPA-T4), CD45 FITC (BD Biosciences, 347463, clone 2D1), 
HLA-DR PerCP (BD Biosciences, 347364, clone L243), CD69 PE (BD 
Biosciences, 555531, clone FN5O0), anti-CD38 PE-Cy7 (BD Biosciences, 
335790, clone HB7), CD25 APC (BD Biosciences, 340938, clone 2A3), 
CD8 APC-Cy7 (BD Biosciences, 557834, clone SK1) and AQUA (Ther- 
moFisher, L35957). Flow cytometry gating was performed as follows. 
Step 1, forward-scatter height and forward-scatter area were used to 
eliminate doublets. Step 2, side-scatter area and forward-scatter area 
were used to distinguish leukocytes based on morphology. Step 3, the 
viability dye AQUA was use to discriminate live cells from dead cells. 
Step 4, live cells were analysed for the expression of the human pan- 
leukocyte marker CD45. Step 5, human leukocytes were then assessed 
for human CD3 expression to identify T cells. Step 6, T cells were evalu- 
ated for the expression of human CD4 and CD8. Step 7, human CD4* 
or CD8* T cells were examined for expression of HLA-DR and/or CD38, 
or CD25. Gates were set with fluorescence-minus-one controls. Non- 
specific binding was assessed with isotype controls. 


Experimental design CD8 in vitro suppression assay 

Invitro latently infected memory CD4* T cells were generated using the 
LARA method as previously described” with the following modifica- 
tions. On day O, after PBMCs were isolated from buffy coats of HIV-naive 
individuals (New York Blood Center) using SepMate density-gradient 
centrifugation (StemCell Technologies, 85460), a portion of PBMCs 
from each HIV-naive donor was cryopreserved in fetal bovine serum 
(VWR Life Science Seradigm, 97068-085) and 10% DMSO, and stored in 
liquid nitrogen. On day 8, PBMCs were thawed and rested overnight in 
RPMI 1640 medium (Fisher Scientific, SH3002701.01) supplemented 
with 10% fetal bovine serum, 1% penicillin-streptomycin (Corning, 
45000-650), and 1% HEPES (Gibco, 15630080); cRPMI) before total 
CD8*° T-cell-positive enrichment on day 9 (Miltenyi, 130-045-201). CD8* 
T cells were stimulated with anti-CD3/CD28 beads (Dynabeads, 11141D) 
atal:1 ratio plus 30 U mI IL-2 (R&D Systems, 202ILO50CF) for 3 days. 
On day 12 of LARA, CD4* and CD8' were prepared for co-culture. HIV 
latently infected memory CD4* T cells were washed, counted and plated 
for latency reversal in CRPMI in the presence of ART (100 nM efavirenz, 
200 nM raltegravir and 5 iM saquinavir). Activated total CD8* T cells 
were removed from the anti-CD3/CD28 beads, washed and resuspended 
in CRPMI plus ART. CD4* and CD8*° T cells were co-cultured at a 1:1 or 
1:5 ratio at a final density of 1 x 10° cells per ml. CD4* monocultures 
were also maintained in parallel. Mono- and co-cultures were then 
left unstimulated, TCR-activated with 1 pg mI“ plate-bound OKT3 and 
1pg mI" soluble CD28 (Biolegend, 302933), or treated with 14 nMN-803 
(provided by NantKwest) or 500 ng mI IL-15 (R&D Systems, 247-ILB). 


Cells were collected after 72 h and analysed by flow cytometry and 
qPCR. 

Multicolour flow analysis of cell-surface and intracellular marker 
expression was performed with a BD FACSymphony flow cytometer. 
Between 200,000 and 600,000 events were acquired for each sample 
using the live-cell gate. The data were analysed with FlowJo (v.10). 

Antibodies used in in vitro suppression assays include: CD3 AF700 
(BD Biosciences, 557943, clone UCHT1), CD8 BUV737 (BD Biosciences, 
564629, clone SK1), HIV-1 core antigen-FITC (Beckman Coulter, 
6604665, clone KC57), CD4 BV421 (BD Horizon, 565997, clone SK3), 
CD45RA APC-eFluor780 (Invitrogen, 47045842, clone HI100), CD27 
BV650 (Biolegend, 302828, clone 0323), CCR7 Pe-Cy7 (BD Pharmingen, 
557648, clone 3D12) and Fixable Viability Dye eFluor 506 (Invitrogen 
eBioscience, 65-0866-18). 


Statistics and reproducibility 

Statistical analyses, including two-way Kruskal-Wallis tests, two-way 
Friedman tests, one-way ANOVA and Spearman r correlations, were 
performed using GraphPad Prism v.7.0 or v.8.0. Data are mean+s.e.m. 
as indicated. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Illumina sequencing reads for RNA-sequencing experiments were sub- 
mitted to the NCBI SRA repository (accession number SRP188630). 
RNA-sequencing datasets were submitted to the NCBI GEO repository 
(accession number GSE128415). env nucleotide sequences have been 
deposited in the GenBank (accession numbers MK922999-MK923550). 
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Extended Data Fig. 1| MT807R1 effectively depletes CD8' T cellsin 
peripheral blood, lymph node and rectum of macaques in addition to 
depleting NK cells from the blood at day 7. The percentage of CD8’ cellsinthe 
CD3* population 7 days after depletion was compared to the levels before 
depletion. a-c, Sample flow cytometry shows the absence of CD8f* cells as 
part of the CD3* T cell population after depletion in the peripheral blood (a), 
rectum (b) and lymph nodes (c) of macaques. Similar results were found across 
all CD8-depleted macaques (n = 28 biologically independent samples). d, The 
percentage of CD88" cells compared to pre-depletion baseline was calculated 
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in all CD8-depleted macaques (treated with or without N-803, n=28 macaques) 
across blood and tissue samples (no differences in CD8* T cell depletion were 
observed between groups 2 and 3 on day 7). Atwo-sided Friedman test was used 
tocalculate statistically significant changes compared to baseline across 
tissues. e, Depletion of NK cells in the peripheral blood was assessed 1 week 
after CD8 depletion alone (n=14 macaques) compared to baseline. Statistical 
significance was calculated using Wilcoxon signed-rank test. Dataare 

mean +s.e.m. 
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Extended Data Fig. 2| Phenotypic changes to CD4' T cells after intervention. 
Longitudinal flow cytometry analysis after treatment with only N-803 (green, 
n=7 macaques), only CD8 depletion (blue, n=14 macaques) and after CD8 
depletion and treatment with N-803 (red, n=14 macaques). a, CD4* T cell 
frequency. b-f, Percentage of naive (b), stem cell memory (SCM) (c), central 
memory (CM) (d), transitional memory (TM) (e) and effector memory (EM) (f) 
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CD4* T cells. g-I, Percentage of bulk CD4* T cells that express PD-1(g), CD25 (h), 
CD69 (i), HLA-DR (j), CCRS (k) and both CCRS and Ki-67 (I). m-v, CCR5 (m-q) 
and Ki-67 (r-v) expression in different subsets of CD4" T cells. Dataare 

mean +s.e.m. Two-sided Kruskal-Wallis tests were used to compare values 
after the intervention to the pre-intervention baseline and approximate P- 
value summaries are provided. 
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Extended Data Fig. 3 | SIV-associated genes and IL-15 subunit genes showa sequencing data were aligned to the MacaM v.7.8 assembly of the Indian rhesus 
transient change in expression after treatment with N-803 alone. RNA was macaque genome. Transcripts were analysed for alignment against acustom 
extracted from sorted peripheral bulk CD4* T cells (CD3*CD4*CD8° CD20— gene set with SIV host restriction factors, PPIA (capsid folding protein), SIV 
CD14 ) (left), bulk CD8* T cells (CD3*CD4 CD8*CD20 CD14 ) (middle) and NK receptors, SIV receptor agonists, NF-KB subunits (involved in mediating the 
cells (CD3:CD20° CD14 NKG2A’) (right) and libraries were prepared, transcription of long-terminal repeat regions), IL-1S receptor subunits and 


normalized, pooled and clustered on flow cells for sequencing. RNA- NFAT subunits. D, day; W, week. 
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Extended Data Fig. 4 | Quantification of levels of cell-associated SIVRNAin 
peripheral CD4*T cells before and after interventions. a—c, Changesin 
expression of SIVRNA in relation to the number of copies of CD4 after 
intervention with N-803 alone (n=7 macaques) (a), CD8 depletion alone 
(n=7 macaques) (b) or CD8 depletion combined with N-803 

(n=7 macaques) (c). Dataare mean+s.e.m. Two-sided Wilcoxon signed-rank 
tests were used to compare values after the intervention to the pre- 
intervention baseline. 
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Extended Data Fig. 5| Level of virus reactivation correlated withtheabsence depletion with N-803 treatment (d; n=14 macaques). Correlation coefficients 


of CD8* T cells. a, b, Correlation between CD8‘ T cell counts and viral load (SIV are calculated using the Spearman’s rank-order correlation (two-tailed, no 
RNA copies per ml of plasma) on day O, day 3, and weekly up to week 6 after adjustments). e-g, Longitudinal viral loads (top) and CD8* T cell counts 
interventions. a, CD8 depletion alone (n=103 samples from 14 macaques). (bottom) after N-803 treatment alone (e; n=7 macaques), CD8 depletion alone 
b, CD8 depletion combined with N-803 treatment (n=112 samples from (f; n=14 macaques) or CD8 depletion combined with N-803 treatment (g;n=14 
14 macaques).c, d, The area under the curve (AUC) and the average pre- macaques). Colour keys along the bottom indicate animal IDs. 
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Extended Data Fig. 6 | HIV DNA, HIV RNA and human T cell activation levels 
in HIV-infected, ART-suppressed BLT humanized mice treated with N-803, 
CD8 depletion alone or combined CD8 depletion with N-803.a, b, HIV- 
infected, ART-suppressed BLT humanized mice were treated with N-803 
(green, n=4 BLT humanized mice), CD8 depletion alone (blue, n=4), or treated 
with CD8 depletion with N-803 (red, n=4). After 7 days, total HIV DNA (a) and 
cell-associated HIV RNA (b) were extracted from mononuclear cells isolated 
from the spleen, human-derived thymus (huThy) and lymph node (LN; HIVRNA 


only). c-f, Percentages of HLA-DR*, CD38", CD25* or HLA-DR*CD38*' cells were 
measured in human CD4' (c, e) or CD8" (d, f) T cells isolated fromthe spleen 

(c, d) or human-derived thymus (e, f) of HIV-infected, ART-suppressed BLT 
humanized mice 7 days after treatment with N-803 (green, n=4), CD8 depletion 
alone (blue, n=4) or CD8 depletion combined with N-803 treatment (red, n=4). 
Treatment groups were compared using a two-tailed Student’s f-test (a) ora 
Kruskal-Wallis test with a false-discovery rate correction (b-f). Data are 
meant+s.e.m. 
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Extended Data Fig. 7| Phylogenetic trees of longitudinal SGA-derived Env (black). The horizontal bar below each tree indicates the genetic distance. 
amino acid sequences. Phylogenetic trees were generated for six macaques Sequence clusters that are supported by bootstraps greater than 80% are 

that received CD8 depletion with N-803 using Env sequences from the peak indicated by an asterisk. Env sequences that contain a stop codon are indicated 
viral load (red), pre-ART (blue) and reactivation (green) time points. The Env by an arrow. 


sequence of the SIV,,4.235 clone used for infection is included in each tree 
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Extended Data Fig. 8 | Longitudinal Env amino acid divergence fromthe reactivation (green) time points are shown. The animal ID and the three time 
input virus and relationship with viral load in the plasma. The number of points areindicated onthex axis. Statistical differences between time points 
aminoacid differences between the infecting viral clone SIV,,4¢239 and each SGA for each macaque were determined by performing multiple comparisons using 
amplicon was determined using Geneious. a, Violin plots show the frequency a Kruskal-Wallis test with Dunn’s correction. b, The average number of 
distribution of the number of amino acid differences between sequences at sequence differences for each macaque at the reactivation time pointis 
each time point ineach macaque. The solidlineindicatesthemediannumberof plotted ontheyaxis, andthe corresponding plasma viral loads are plotted on 
amino acid differences for each individual Env sequence; the dotted lines thexaxis ona log, scale. Correlation coefficients are calculated using the 


indicate the quartiles. Peak viral load (VL) (red), pre-ART (blue) and Spearman’s rank-order correlation (two-tailed, no adjustments). 
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Extended Data Fig. 9| Highlighter plots of longitudinal SGA-derived Env 


amino acid sequences. Highlighter plots were generated for six representative 


macaques that were depleted of CD8 and treated with N-803 using Env 
sequences from peak VL (red box), pre-ART (blue box) and reactivation (green 
box) time points. The Env sequence of the SIV nac239 clone used for infection is 


included as the master (reference) sequence in each plot. The position of N- 
linked glycosylation sites on the master sequence are indicated by pink circles. 
Eachtick represents an amino acid difference from the master sequence, as 
indicated in the key. Blue diamonds indicate the loss of an N-linked 
glycosylation site. 


Extended Data Table 1| Viral loads from macaque and humanized mouse studies 


Model: ART-treated, SlV-infected rhesus macaques (limit of detection = 3 copies of SIV RNA/mL of plasma) 
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Model: ART-treated, SHIV-infected rhesus macaques (limit of detection = 3 copies of SIV RNA/mL of plasma) 
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Model: ART-treated, HIV-infected bone marrow-liver-thymus humanized mice (limit of detection = 346 copies of SIV RNA/mL of plasma) 
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Longitudinal viral loads before and after interventions using the ART-treated, SIV.nac239-infected rhesus macaque model (limit of detection of three copies of SIV RNA per ml of plasma; corre- 
sponding to Fig. 2a—d) (top), the ART-treated, SHIVsr162p3-infected rhesus macaque model (limit of detection of 3 copies of SHIV RNA per ml of plasma; corresponding to Fig. 2g) (middle) and the 
ART-treated, HIV p.cs¢-infected BLT humanized mouse model (limit of detection of 346 copies of HIV RNA per ml of plasma; corresponding to Fig. 2i-k) (bottom). 
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All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 


- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


lllumina sequencing reads for RNA-Seq experiments were submitted to the NCBI SRA repository and are available at Accession #SRP188630. RNA-Seq datasets were 
submitted to the NCBI GEO repository and are available at accession number GSE128415. Env nucleotide sequences have been deposited into Genbank under the 
accession number MK922999-MK923550. 


=) 
fev) 
a 
e 
= 
o 
= 
o 
Nn 
o 
fev) 
= 
a 
= 
= 
io) 
O) 
(e) 
aN 
= 
a 
Nn 
S 
3} 
5} 
fev) 
5 
< 


Field-specific reporting 


Please select the one below that is the best fit for your research. If you are not sure, read the appropriate sections before making your selection. 


x Life sciences Behavioural & social sciences Ecological, evolutionary & environmental sciences 


For a reference copy of the document with all sections, see nature.com/documents/nr-reporting-summary-flat.pdf 


Life sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Sample size Based on our previous data on SIV-infected ART -treated rhesus macaques, with a sample size of at least 7, we would be able to detect a 
significant difference between pre- and post-CD8 depletion samples in the level of plasma RNA at the 0.05 significance level with a power of 
0.90. 


Data exclusions No data exclusion was applied to this study. 


Replication The use of non-human primates precludes our ability to replicate experiments. Sample sizes were chosen to maximize the likelihood of 
detecting statistical differences. 


Randomization Age, weight, sex, AO1 status, peak post-infection viral load, and time to suppression after ART were all controlled for when allocated animals 
into experimental groups. 


Blinding No blinding was used in this study. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
Antibodies ChIP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 


Animals and other organisms 


Human research participants 


Clinical data 


Antibodies 


Antibodies used Macaque studies: CCR5 APC (3A9), CCR7 FITC (150503), CD3 APC-Cy7 (SP34-2), CD4 BV650 (OKT4), CD8a BV711 (RPA-T8), CD8B 
PE-Cy5 (SIDISBEE), CCR5 APC (3A9), CCR7 FITC (150503), CD45RA Pe-Cy7 (5H9), CD62L PE (SK11), CD95 BV605 (DX2), PD-1 BV421 
(EH12.2H7), CD16 BV421 (3G8), CD20 PE-Cy5 (2H7), CD14 PE-Cy7 (M5E2), NKG2A (CD159) PE (2199), CXCRS PE-eFluor610 
(MUSUBEE), CD28 PE-Cy5.5 (CD28.2), and CD56 FITC (NCAM16.2). Human primary cell model: CD3 Alexa Fluor® 700 (UCHT1, BD 
Biosciences, #557943), CD8 BUV737 (SK1, BD Horizon™, #564629), HIV-1 core antigen-FITC (KC57, Coulter Clone, #6604665), 
CD4 BV421 (SK3, BD Horizon™, #565997), CD45RA APC- eFluor™780 (HI100, Invitrogen, #47045842), CD27 BV650 (0323, 
Biolegend, #302828), CCR7 Pe-Cy7 (3D12, BD Pharmingen™, #557648), Fixable Viability Dye eFluor™ 506 (Invitrogen eBioscience 
#65-0866-18). Humanized mice model: CD45 APC (clone HIT3a; BD Biosciences #555485), CD3 FITC (clone HIT3a; BD Biosciences 
#555339), CD4 APC-Cy7 (clone RPA-T4; BD Biosciences #560158), CD33 PE (clone P67.6; BD Biosciences #340679); CD19 PE-Cy7 
(clone SJ25C1; BD Biosciences #557835) and CD8 PerCP (clone SK1; BD Biosciences #347314), CD3 BV421 (clone UCTH1; BD 
Biosciences #562426), CD4 BV605 (clone RPA-T4; BD Biosciences #562658), CD45 FITC (clone 2D1; BD Biosciences #347463), 
HLA-DR PerCP (clone L243; BD Biosciences #347364), CD69 PE (clone FN50; BD Biosciences #555531), anti-CD38 PE-Cy7 (clone 
HB7; BD Biosciences #335790), CD25 APC (clone 2A3; BD Biosciences #340938), CD8 APC-Cy7 (clone SK1; BD Biosciences 
#557834), and AQUA (ThermoFisher #135957). 


Validation Antibodies used in macaque studies were validated in previous studies and the NIH Nonhuman Primate Reagent Resource. 
Independently validated in monochromatic titration on primary macaque cells. Antibodies used the humanized mice and human 
primary cell models were also independently validated in monochromatic titration experiments. 
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Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Indian-origin rhesus macaques, male and female bred at the Yerkes National Primate Research center. Animals were roughly four 
years of age at the start of the study. 


Wild animals NA 
Field-collected samples NA 
Ethics oversight All procedures are approved by the Emory University Institutional Animal Care and Use Committee (IACUC) and animal care 


facilities are accredited by the U.S. Department of Agriculture (USDA) and the Association for Assessment and Accreditation of 
Laboratory Animal Care (AAALAC) International. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 
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Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 
Sample preparation Described in methods pages 30-31. 
Instrument BD LSR Il 
Software FACS Diva V8.0.1., FlowJo V9.9.6. 


Cell population abundance Purity was high, as determined by flow cytometry of purified sample post-sort. 


Gating strategy As described in methods pages 30-31. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Long-lasting, latently infected resting CD4* T cells are the greatest obstacle to 
obtaining a cure for HIV infection, as these cells can persist despite decades of 
treatment with antiretroviral therapy (ART). Estimates indicate that more than 

70 years of continuous, fully suppressive ART are needed to eliminate the HIV 
reservoir’. Alternatively, induction of HIV from its latent state could accelerate the 
decrease in the reservoir, thus reducing the time to eradication. Previous attempts to 
reactivate latent HIV in preclinical animal models and in clinical trials have measured 
HIV induction in the peripheral blood with minimal focus on tissue reservoirs and 
have had limited effect” °. Here we show that activation of the non-canonical NF-KB 
signalling pathway by AZD5582 results in the induction of HIV and SIV RNA expression 
inthe blood and tissues of ART-suppressed bone-marrow-liver-thymus (BLT) 
humanized mice and rhesus macaques infected with HIV and SIV, respectively. 
Analysis of resting CD4* T cells from tissues after AZD5582 treatment revealed 
increased SIV RNA expression in the lymph nodes of macaques and robust induction 
of HIV in almost all tissues analysed in humanized mice, including the lymph nodes, 
thymus, bone marrow, liver and lung. This promising approach to latency reversal—in 
combination with appropriate tools for systemic clearance of persistent HIV 
infection—greatly increases opportunities for HIV eradication. 


Latently infected cells carrying an integrated replication-competent 
provirus that contribute to viral rebound after the interruption of 
ART (termed the HIV reservoir) are not detected and eliminated by 
the immune system or current therapeutics. Therefore, the HIV res- 
ervoir has been targeted by approaches to reverse latency and induce 
viral antigen production (that is, ‘HIV reactivation’)? °, which renders 
infected cells susceptible to virus-induced cell death or clearance by 
the immune system. Previous approaches to HIV reactivation have 


been modestly effective and have not demonstrated reactivation of 
HIV in resting CD4* T cells in tissues” *. 


HIV induction in vitro by SMAC mimetics 

The lack of specificity of molecules that activate the NF-KB pathway 
as latency-reversal agents (LRAs) often leads to toxicities that pre- 
vent clinical implementation’. We tested the induction of HIV and 
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Fig. 1| Efficient in vitro AZD5582 target engagement and induction of HIV 
transcription. a, Total CD4* T cells were treated with a broad range of 
concentrations (10 pM-1 pM) of AZD5582 overnight, and cell lysates were 
analysed by immunoblot, probing for clAP1and p100/p52 as indicated (top; 
representative of 10 experiments). Immunoblot analysis of isolated total CD4* 
Tcelllysates after treatment with 100 nM AZD5582, examining components of 
the cNF-kB and ncNF-KB pathways over a48-htime course after treatment 
(middle and bottom; representative of three and four experiments, 
respectively). b, DMSO-normalized reporter signal induced by a dose titration 
of a panel of mono- and bivalent SMAC mimetics in aJurkat luciferase reporter 
model of HIV-1 latency with 48 h exposure. Symbols represent technical 
replicates froma single run and are representative of three independent 
experiments. Lines represent a four-parameter logistic-regression model fit. 
c, Infectious units per million resting CD4* T cells induced by DMSO or 100 nM 
AZD5582 were determined ina limiting dilution quantitative viral outgrowth 
assay. PHA, phytohaemagglutinin. d, Volcano plots summarizing mean up- and 
downregulated genes at 2, 6 and 24 hafter treatment with ingenol B or 
AZD5582 compared with treatment with DMSO alone. The mean log,- 
transformed fold change is shown on thex axis and log, -adjusted Pvalues 
(two-sided Wald test) are shown onthe y axis. Dashed lines represent 
thresholds of log,-transformed fold change of land adjusted P< 0.05. The data 
shown represent the mean fold change across four donors and one experiment. 
For gel source data, see Supplementary Fig. 1. 


SIV transcription in latently infected cells by the non-canonical (nc) 
NF-KB pathway. This pathway activates a limited number of cellular 
genes and a more-gradual but persistent activation of NF-KB-driven 
transcription than the canonical (c)NF-KB pathway". Mimetics of the 
second mitochondrial-derived activator of caspases (SMAC) activate 
the ncNF-KB pathway by inhibiting the cellular inhibitor of apoptosis 
protein 1 (cIAP1) and clAP2. clAP1 continually represses the ncNF-KB 
pathway by constitutively degrading the NF-KB-inducing kinase, thus 
preventing processing of p100 into p52”; this repression can be relieved 
in CD4* T cells by in vitro treatment with the SMAC mimetic AZD5582 
(Fig. laand Extended Data Fig. 1). Compared with other SMAC mimet- 
ics, AZD5582 had a superior capacity to reverse HIV latency in vitro” 
(Fig. 1b). AZD5582 also induced replication-competent HIV expres- 
sionin resting CD4°T cells from ART-suppressed HIV-infected donors 
(Fig. 1c). AZD5582 induced five- to tenfold fewer genes than the protein 
kinase C agonist ingenol B (Fig. 1d), a cNF-KB pathway inducer and 
activator of several transcription factors. By specifically targeting the 


ncNF-kB signalling pathway, AZD5582 has limited pleotropic effects, 
which may translate to fewer off-target effects”. 


Latency reversal in BLT humanized mice 


BLT mice were infected with HIV-1jp.cs¢ (Supplementary Table 1) and sup- 
pressed with ART> “8 (Fig. 2a, b). Mice then received a single intraperi- 
toneal injection of 3 mg kg+AZD5582 or vehicle. No changes in plasma 
HIV RNA levels were detected in vehicle-control-treated mice at 24 or 
48 hnor in AZD5582-treated mice 24 h after AZD5582 administration 
(Fig. 2c). However, 48 h after AZD5582 treatment increased HIV RNA 
expression was detected in the plasma of 3 out of 6 (50%) and 3 out 
of 4 (75%) mice in two independent experiments (Fig. 2c). These data 
demonstrate that a single dose of AZD5582 can induce HIV production, 
resulting in significant viraemia (up to 1,574 HIV RNA copies per ml 
plasma) in ART-treated BLT mice (Supplementary Table 2). 

The hallmark of HIV persistence in humans is the presence of inducible 
HIV in resting CD4* T cells. Therefore, resting cells from primary (bone 
marrow and thymic organoid), secondary (lymph node and spleen) and 
effector (liver and lung) immune tissues were isolated from HIV-infected 
ART-suppressed BLT mice 48 h after treatment with vehicle control or 
AZD5582”°?0, The levels of HIVRNA in resting CD4* T cells from AZD5582- 
treated mice were 11-fold (bone marrow, P= 0.0201), 21-fold (thymic 
organoid, P=0.0038), 12-fold (lymphnode, P=0.0004), 1.4-fold (spleen, 
P=0.0426),24-fold (liver, P=0.0145) and 3.2-fold (lung, P=0.0029) higher 
than controls (Fig. 2d). These results were confirmed in a second inde- 
pendent experiment (Extended Data Fig. 2). No notable differences in 
cell-associated HIV DNA were noted between mice treated with vehicle 
control or AZD5582 (Supplementary Table 3). These results demonstrate 
that AZD5582 induces systemic HIV RNA production in resting CD4* 
T cells, indicative of latency reversal in this important cellular source of 
persistent HIV infection. We also isolated cells from the peripheral blood, 
female reproductive tract and brain of HIV-infected ART-suppressed BLT 
mice 48 h after treatment with AZD5582 or vehicle control. As too few 
CD4" T cells were available for cell sorting from these compartments, 
RNA was extracted from total cells and analysed for the presence of HIV 
RNA in each tissue. The levels of HIV RNA were significantly higher in 
the female reproductive tract (3.4-fold, P= 0.0152) and brain (8.7-fold, 
P=0.0147) of AZD5582-treated mice compared with vehicle controls, but 
not in the blood (P= 0.3095) (Fig. 2e). Together, these results show that 
AZD5582 treatment induces systemic HIV RNA production in BLT mice. 


Pharmacodynamics and safety in BLT mice 


Target engagement after treatment with AZD5582 was confirmed ex 
vivo by the degradation of clAP1 (proximal) and p100 (distal), targets 
of SMAC in the ncNF-kB pathway (Extended Data Fig. 3a). In vivo tar- 
get engagement was demonstrated in resting CD4* T cells isolated 
from thymus, spleen, lymph nodes, liver, lung and bone marrow of 
BLT mice treated with a single dose of 3 mg kg 1 AZD5582 or vehicle 
control (Extended Data Fig. 3b). These results were confirmed by 
immunohistochemical analysis of the thymic organoid of HIV-infected 
ART-suppressed BLT mice. This analysis showed a marked reduction 
in clAP1 expression in the thymic organoid of AZD5582-treated mice 
(Extended Data Fig. 3c). 

To study off-target or immune-mediated toxicities of AZD5582, we 
measured serum chemistry, T cell activation and a panel of plasma 
cytokines after in vivo treatment of immunocompetent BALB/c mice. 
AZD5582 administration resulted in mild and transient increases in alanine 
aminotransferase and aspartate aminotransferase that resolved afew days 
after treatment. No other changesin serum chemistries were noted (Sup- 
plementary Table 4). In addition, no differences were noted inthe levels of 
activated (CD38*HLA-DR’*) CD4* or CD8*' T cells in BLT mice treated with 
AZD5582 or vehicle control (Supplementary Table 5) or in plasma levels of 
41 humancytokines and chemokines that serve as indicators of systemic 
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Fig. 2|AZD5582 induces HIV RNA expression in resting CD4* T cells from 
tissues of HIV-infected ART-suppressed BLT mice. a, BLT mice were infected 
with HIV-Lp.csp, After 10 weeks of ART treatment, mice received vehicle control 
or AZD5582. FTC, emtricitabine; RAL, raltegravir; TDF, tenofovir disoproxil 
fumarate. b, HIVRNA copies per ml‘ of plasma of HIV-infected ART-treated BLT 
mice before treatment with vehicle control (left; blue lines) or AZD5582 (right; 
red lines). Two independent experiments were performed (left, n=6 mice per 
group; right, n=4 mice per group). Grey shading, period of ART administration. 
c, Plasma HIV RNA levels in HIV-infected ART-suppressed mice from btreated 
with vehicle control or AZD5582. d, HIV viral RNA (vRNA) levels in resting CD4* 
Tcells isolated from the bone marrow (BM), thymic organoid (Org), lymph 
nodes (LN), spleen, liver and lung of control or AZD5582-treated mice (cells 
pooled fromn=6 mice per group for each tissue) were analysed in triplicate. 
Data are mean +s.e.m. Statistical significance was determined using a two- 
sided Student’s t-test. e, Cell-associated HIV RNA copies in the blood (n=6), 
female reproductive tract (n= 6) and brain (n=3). PBMCs, peripheral-blood 
mononuclear cells. Statistical significance was determined using atwo-sided 
Mann-Whitney test (peripheral-blood mononuclear cells and female 
reproductive tract) or Student’s ¢-test (brain). Colours indicate samples from 
the same mice. Data are mean+s.e.m. 


activation and inflammation”! (Supplementary Table 6). Together, these 
results demonstrate that AZD5582 does not cause generalized toxicity 
or activation of the immune system inthe BLT model. 


Latency reversal in rhesus macaques 

We next evaluated the latency-reversal activity of AZD5582 in 21 
MamuB*08 and MamuB*17 rhesus macaques infected with SIV,,4.239 
and treated with a potent ART regimen comprising tenofovir diso- 
proxil fumarate, emtricitabine and dolutegravir initiated 8 weeks after 
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infection (Fig. 3a and Supplementary Table 7)””*. Suppression of SIV 
viraemia below 60 copies per ml (standard assay limit of detection) 
was achieved in all macaques in 2-20 weeks and ART was continued for 
55-67 weeks before further treatment (Fig. 3b). On the basis of phar- 
macokinetic and pharmacodynamic data from uninfected macaques 
(Extended Data Fig. 4a) as well as protocols for SMAC mimetics used in 
oncology, intravenous infusions of 0.1 mg kg’ AZD5582 were admin- 
istered weekly to 12 SIV-infected ART-suppressed rhesus macaques 
for 3 or 10 weeks (Fig. 3a). Nine SIV-infected ART-suppressed rhesus 
macaques served as controls (Fig. 3a). Plasma concentrations of 
AZD5582 measured after the first, third, sixth and tenth dose showed 
that drug exposures in SIV-infected ART-suppressed macaques were 
consistent across the treatment period and comparable to those 
observed in uninfected rhesus macaques (Extended Data Fig. 4b). 

Latency reversal, defined as on-ART viraemia increasing from less 
than 60 copies per ml of plasma to more than 60 copies per ml of 
plasma after AZD5582 treatment, was observed as early as 96 h after 
the first dose and reached levels as high as 1,390 copies per ml in SIV- 
infected rhesus macaques (Fig. 3c, d).On-ART viraemia >60 copies per 
ml of plasma was observed in 5 out of 12 rhesus macaques (42%), cor- 
responding to 5 out of 9 rhesus macaques (55%) that received 10 doses 
of AZD5582 (Fig. 3c, d). Multiple instances of sustained viraemia >60 
copies per ml between AZD5582 doses were observed. Out of 140 viral 
load measurements performed on the 5 macaques that exhibited on- 
ART viraemia during AZD5582 treatment, 64 were >60 copies per ml 
(46%); inthe macaque with the greatest frequency of reactivation, this 
proportion was 15 out of 28 (53%). Longitudinal examination of plasma 
virus by single-genome sequencing analysis of the SIV,,4-239 env gene 
in all rhesus macaques that experienced AZD5582-induced on-ART 
viraemia was performed at four selected time points: 2 weeks after 
infection (near peak viraemia), 8 weeks after infection (immediately 
before ART initiation), and at 2 time points separated by 26-42 days 
during AZD5582 treatment. Phylogenetic analyses showed several 
patterns of virus reactivation (Extended Data Fig. 5). In two rhesus 
macaques (RDI16 and RKn16), most of the reactivated virus sequences 
were phylogenetically closer to sequences at eight weeks after infection 
rather than peak viraemia and were unique, indicating that the variants 
produced during AZD5582 treatment originated from multiple cells 
that were seeded at the time of ART initiation. In two other rhesus 
macaques (RKI16 and RDm16), a large fraction of the viruses produced 
during AZD5582 treatment showed identical sequences, suggesting 
that latency reversal occurred froma single cell or aclonally expanded 
population of infected cells. These clones clustered with both peak 
and pre-ART time points and were accompanied by additional unique 
sequences. In one rhesus macaque (RLy15), a single virus sequence was 
amplified at each time point during AZD5582 treatment and these were 
both phylogenetically similar to sequences found before ART treat- 
ment. Taken together, these results indicate that AZD5582 induced 
virus reactivation from a diverse population of cells, some of which 
may be clonally expanded”>”*, 

We quantified cell-associated SIV RNA and SIV DNA in resting CD4* 
T cells sorted from SIV-infected ART-suppressed rhesus macaques 
treated or not with AZD5582. Cell-associated SIV RNA levels in resting 
CD4' T cells isolated from lymph nodes were significantly higher in 
macaques who received ten doses of AZD5582 compared with controls 
(P=0.0148) (Fig. 3e). A similar trend was observed in resting CD4* T cells 
isolated from the spleens of a subgroup of six macaques that were 
euthanized. Levels of cell-associated SIV DNA in resting CD4* T cells 
were similar in each compartment across groups (Fig. 3e). To further 
understand whether latency reversal induced by AZD5582 resultedina 
perturbation of the overall level of infected CD4*T cells, we performed 
longitudinal measurements of cell-associated SIV DNA in total (rather 
than resting) CD4" T cells isolated from lymph nodes and blood as 
well as quantitative viral outgrowth assays using CD4* T cells from 
lymphnodes and spleen at the end of the treatment period (Extended 
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Fig. 3 |AZD5582 induces SIV RNA expression in the plasma and lymph nodes 
of ART-suppressed SIV-infected rhesus macaques. a, Experimental design 
during AZD5582 treatment phase. Three rhesus macaques received three 
doses of AZD5582 and were euthanized 48 h after the last dose. Nine rhesus 
macaques received 10 doses of AZD5582 and 3 were euthanized 48 h after the 
last dose. The first dose of AZD5582 was administered after 55-67 weeks of ART 
(3-dose group, 55 weeks; 10-dose group, 55-67 weeks). Four control rhesus 
macaques received a weekly placebo infusion; 2 were euthanized 48 hafter 3 
infusions and 2 were euthanized 48 h after 10 infusions. Five control macaques 
received ART only. b, Plasma SIV RNA levels in the 21 SIV-infected rhesus 
macaques before treatment with AZD5582 (left, n =12) and equivalent time 
period for controls (right, n= 9).b-d, Grey shading represents the period of 
ART administration. c, Plasma SIV RNA levels in ART-suppressed SIV-infected 
rhesus macaques during AZD5582 treatment (left, n=12) and the equivalent 


Data Fig. 6a—c). Despite the high level of virus reactivation induced by 
AZD5582, these experiments did not reveal a consistent reduction inthe 
total or replication-competent SIV reservoir compared with controls. 


Pharmacodynamics in rhesus macaques 


Pharmacological target engagement of the ncNF-KB pathway was 
confirmed by western blot analysis of the degradation of p100 to p52 


time period for controls (right, n= 9). d, Individual representation of plasma 
SIVRNA levels in the five rhesus macaques that experienced on-ART viraemia 
during AZD5582 treatment. e, Cell-associated SIV RNA (left) and SIV DNA 
(right) levels in resting CD4* T cells isolated from lymph nodes, peripheral 
blood (PB), spleen and bone marrow of AZD5582-treated (red) and control 
(blue) ART-suppressed SIV-infected rhesus macaques. Resting CD4*T cells 
were analysed from AZD5582-treated rhesus macaques 48 h after receiving 3 
doses (D3; lymph nodes, peripheral blood and spleen, n=3) or 10 doses (D10; 
lymphnodes, n=7; peripheral blood, n=9; spleen, n=3; bone marrow, n= 6) of 
AZD5582. Resting CD4*T cells were analysed from control rhesus macaques 

(C; lymph nodes and peripheral blood, n=9; spleen, n=4; bone marrow, n=5) at 
equivalent time points. Open symbols indicate AZD5582-treated rhesus 
macaques with on-ART viraemia. Statistical significance was determined witha 
two-sided Mann-Whitney U-test. Horizontal lines represent the median. 


in lymph-node mononuclear cells after in vivo exposure to AZD5582 
(Extended Data Fig. 4c) and in splenocytes treated ex vivo with AZD5582 
(Extended Data Fig. 4d-g). Transcriptomic profiling of CD4* T cells 
from the peripheral blood and lymph nodes isolated from AZD5582- 
treated rhesus macaques showed a distinct effect of AZD5582 on gene 
expression based on principal component (Fig. 4a) and DAVID pathway 
(Extended Data Fig. 7a) analyses. Enrichment of NF-KB targets after 
AZD5582 treatment was demonstrated by gene-set enrichment analysis 
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(GSEA) (Fig. 4b; a heat map of differentially expressed gene targets of 
NF-KB is shown in Extended Data Fig. 7b). The hallmark ncNF-KB sig- 
nalling genes NFKB2 and RELB were significantly upregulated in CD4* 
T cells from the peripheral blood and lymph nodes after AZD5582 treat- 
ment, whereas cNF-kB signalling molecules were mostly unaffected 
except for the inhibitor NFKBIA (Fig. 4c). The baculoviral IAP repeat- 
containing 3 (B/IRC3) gene, which encodes clAP2 and regulates both 
the cNF-KB and ncNF-KB pathways, was also significantly upregulated 
in CD4* T cells from the peripheral blood and lymph nodes (Fig. 4c). 
Notably, activation of ncNF-kB signalling genes was evident in rhesus 
macaques with and without on-ART viraemia of more than 60 copies per 
ml (Fig. 4c and Extended Data Fig. 7c) and GSEA showed similar changes 
in overall gene expression when AZD5582-treated rhesus macaques 
were grouped according to the presence or absence of on-ART viraemia 
of more than 60 copies per ml (Supplementary Table 8), suggesting 
that the lack of a virological response measured by the standard viral 
load assay was not due to compromised target engagement. 

We next used an ultrasensitive assay to measure SIV RNA in plasma, 
comparing 2-3 baseline time points during ART in the 4 weeks before 
AZD5582 treatment (when plasma SIV RNA was less than 60 copies per 
mlin all rhesus macaques using the standard viral load assay) and 3-4 
time points during AZD5582 treatment (Extended Data Fig. 8a). We 
found significantly higher plasma SIV RNA values during the period 
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of AZD5582 treatment compared with pre-treatment (Extended Data 
Fig. 8b, P= 0.008), with 8 out of 12 rhesus macaques demonstrating 
>2 SIVRNA measurements above the median of their baseline values. 
This group of eight rhesus macaques with evidence of latency rever- 
sal induced by AZD5582 could be segregated from the four rhesus 
macaques with unchanged viral loads based on two parameters: higher 
levels of plasma SIV RNA immediately before ART initiation and SIV 
DNA in peripheral CD4* T cells before AZD5582 treatment (P= 0.004 
for each) (Extended Data Fig. 8c). 


Safety and immune effects in rhesus macaques 


The potential for AZD5582 toxicity in SIV-infected ART-suppressed 
rhesus macaques was examined and transient increases in liver enzymes 
(aspartate aminotransferase and y-glutamyltransferase) were observed 
(Extended Data Fig. 9a). Total white-blood cell counts were decreased 
inall rhesus macaques 48 h after the first dose of AZD5582 but returned 
to normal levels when measured before the second dose and over 
time (Extended Data Fig. 9b). After the seventh and eighth doses, one 
macaque experienced a reaction characterized by fever, emesis, fatigue 
and lack of appetite, and had elevated liver enzyme and creatinine levels 
and bandaemiaas shown by laboratory examination. All abnormalities 
resolved within a two-week period; however, this rhesus macaque did 


not receive further doses. Notably, weights did not significantly fluctu- 
ate over the course of the treatment period (Extended Data Fig. 9c). In 
summary, 97 doses of AZD5582 were administered to 12 SIV-infected 
ART-suppressed rhesus macaques, of which 95 were well-tolerated and 
2 resulted ina mild adverse reaction. 

Markers of T cell activation (both CD4* and CD8°) were assessed 
in AZD5582-treated rhesus macaques and controls, and expression 
of CCR5S, HLA-DR and PD-1 on CD4* T cells from the blood and lymph 
nodes was similar before and after treatment with AZD5582 (Fig. 4d, e). 
Levels of intracellular Ki-67 expression were increased in CD4* T cells in 
the blood and CD8' T cells in the blood and lymphnodes after AZD5582 
treatment (Fig. 4d, e). CD8* T cell expression of HLA-DR was higher in 
the blood after exposure to AZD5582 (Fig. 4e). These results indicate 
that AZD5582 does not induce global CD4" T cell activation in rhesus 
macaques but the combined pharmacological and virological effects 
may have a stimulatory effect on CD8’ T cells. Longitudinal analyses 
of CD4* T cell counts and frequencies, CD4*T cell viability, CD4*T cell 
subset frequencies and their Ki-67 expression are shown in Extended 
Data Fig. 10a—d. The increased Ki-67 observed within CD4* T cells after 
AZD5582 treatment may provide a note of cautionas the proliferation 
of latently infected memory CD4’ T cells is hypothesized to contribute 
to the maintenance of the viral reservoir over time**”’; however, we did 
not find an increase in infected cells after AZD5582 treatment (Fig. 3e 
and Extended Data Fig. 6). 

We next determined whether AZD5582 would impair SIV-specific T cell 
responses in SIV-infected ART-suppressed rhesus macaques, a poten- 
tial adverse outcome of treatment that has been suggested for other 
LRAs’’. However, the frequency of SIV Gag- or Env-specific CD8* T cells 
measured by IFNy ELISPOT did not decrease after AZD5582 treatment 
(Extended Data Fig. 10e), and CD8° T cell polyfunctionality and prolifera- 
tive responses were largely unaffected by AZD5582 treatment ex vivo, 
with the exception of IL-2*IFNy*TNF* triple-positive cells, which were 
slightly reduced (P= 0.0476) (Extended Data Fig. 10f, g). Furthermore, 
longitudinal assessment of plasma levels of inflammatory cytokines and 
chemokines did not reveal any marked changes induced by AZD5582 
(Extended Data Fig. 10h). Overall, our workin the rhesus macaque model 
indicates that AZD5582 treatment is safe in most macaques and can 
induce appreciable increases in plasma SIV RNA as wellas SIV RNA expres- 
sionin resting CD4*T cells from the lymph nodes during ART. 


Discussion 


Eradication of HIV infection after prolonged viral suppression is the 
focus of intense research and latency reversal has been a cornerstone 
of this effort. LRAs have been widely recognized as important (but 
previously mostly theoretical) tools to induce HIV expression in 
resting CD4* T cells in humans. Future clinical applications of HIV 
cure strategies must be relevant to the majority of people living with 
HIV for whom the treatments of the Berlin and London patients”? 
(whose non-HIV life-threatening haematological malignancies war- 
ranted aggressive, toxic therapies) pose an unacceptable level of risk. 
Therefore, LRAs must be identified that are highly effective but have 
minimal side effects. Here, we used two different but highly comple- 
mentary animal models” to show that treatment with AZD5582 had 
minimal and transient side effects but resulted in significant increases 
in HIV and SIV RNA levels both in the plasma and in resting CD4* T cells 
isolated from all analysed tissues from BLT mice and from the lymph 
nodes of macaques. Our results provide in vivo evidence of systemic 
HIV and SIV latency reversal from resting CD4* T cells. The concord- 
ance between the results obtained in two fundamentally different 
animal models highlights the robust and reproducible nature of the 
effect of AZD5582 on HIV and SIV reservoirs. The fact that there is 
little toxicity associated with the use of AZD5582 strongly suggests 
that activators of the ncNF-KB pathway may be well-suited for HIV 
eradication approaches in humans. 
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Methods 


Experimental design 

The purpose of this study was to determine the efficacy of the SMAC- 
mimetic AZD5582 as an HIV and SIV LRA in vivo. To this end, two animal 
models of HIV-Linfection were used: the HIV-infected BLT humanized 
mouse model and the SIV-infected rhesus macaque (Macaca mulatta) 
nonhuman primate (NHP) model. In each model, animals were infected 
with HIV-1 or SIV and viraemia was durably suppressed with ART. 
AZD5582 was administered to infected ART-suppressed animals that 
were then assessed for changes associated with the reactivation of the 
viral reservoir. Mice were maintained under specific-pathogen-free 
conditions by the Division of Comparative Medicine at the University 
of North Carolina, Chapel Hill. Mouse experiments were conducted in 
accordance with NIH guidelines for the housing and care of laboratory 
animals and in accordance with protocols reviewed and approved by the 
Institutional Animal Care and Use Committee (IACUC) at the University 
of North Carolina, Chapel Hill. Healthy rhesus macaques for pharma- 
cokinetic studies were housed at GlaxoSmithKline and all procedures 
were conducted in accordance with the GlaxoSmithKline policy onthe 
care, welfare and treatment of laboratory animals and were reviewed 
by the IACUC at GlaxoSmithKline. Rhesus macaques infected with 
SIV were housed at the Yerkes National Primate Research Center and 
treated in accordance with Emory University and Yerkes National Pri- 
mate Research Center IACUC regulations (PROTO201800308). Animal- 
care facilities are accredited by the US Department of Agriculture and 
the Association for Assessment and Accreditation of Laboratory Animal 
Care International. 


Preparation of Jurkat HIV-luciferase cell clones 

Cell clones Jurkat-C16 and Jurkat-I15 were prepared by infecting Jur- 
kat cells (Jurkat clone E6-1 cells, American Type Culture Collection 
TIB-152, authenticated by morphological identification and virus- 
susceptibility profiles, tested for mycoplasma by the supplier) with 
a full-length, infectious HIV-1,,,.,-based virus engineered to express 
a luciferase reporter in place of the HIV-1 nef gene (NLCH-Luci). The 
Jurkat-N6 cell clone was generated using the same virus as described 
above with an additional mouse heat-stable antigen (HSA) reporter 
located just downstream of the luciferase open-reading frame and 
separated by a T2A element (NLCH-Luci-HSA). NLCH, provided by 
R. Swanstrom, is the parent molecular infectious clone used to make 
the Jurkat clones and is a modification of HIV-1,, ,., (GenBank U26942) 
in which flanking sequences were removed. All viruses were derived 
by transfection of human embryonic kidney 293 cells (HEK 293T, 
European Collection of Authenticated Cell Cultures, authenticated 
by morphological identification, tested for mycoplasma by the sup- 
plier) with 1 pg of the HIV-1,, ,.,-derived infectious molecular plasmid 
DNA using the FUGENEHD Transfection reagent (Promega) according 
tothe manufacturer’s recommendations. Supernatants were collected 
48 h after transfection, passed through a 0.2-ym filter, and used to 
infect wild-type Jurkat cells. After infection, cells expressing high levels 
of HIV-encoded mouse HSA were removed using biotin-labelled rat 
anti-mouse CD24 antibody (clone M1/69, BD Biosciences) that was 
adsorbed to streptavidin-labelled magnetic Dynabeads M-280 (Life 
Technologies) according to the manufacturer’s recommendations. 
Negatively selected HIV-infected Jurkat cells were then limit-diluted 
to 0.5 cells per well in 96-well plates, and individual cell clones were 
expanded for 2-4 weeks in culture in the presence of 500 nM efavirenz. 
Clones were profiled for baseline reporter level and responsiveness to 
benchmark LRAs, with C16, 115 and N6 representing the most quiescent 
but inducible clones that were obtained. 


Cell culture and Jurkat HIV-luciferase assay 
Jurkat HIV-luciferase clones were maintained in Roswell Park Memorial 
Institute (RPMI) 1640 medium (Gibco, Life Technologies) containing 


10% (v/v) fetal bovine serum (FBS; SAFC, Sigma-Aldrich), 25 U mI” peni- 
cillin and 25 U mI streptomycin (Gibco, Life Technologies), and were 
split 1:4 every 3-4 days to maintain a cell density of around 0.3-1 million 
cells per ml. The Jurkat clones were maintained with the addition of 
500 nMefavirenz to the medium. Three Jurkat cell clones (C16, 115 and 
N6), each of which contained one or two integrated HIV proviruses that 
expressed the luciferase reporter gene, were added at equal amounts 
for a total of 5,000 cells per well to 384-well plates containing com- 
pound titrations. Dose-response testing was performed on compounds 
dissolved in dimethyl sulfoxide (DMSO; Fisher Scientific) dispensed 
in duplicate serial threefold, 14-point titrations using a D300e Digital 
Droplet Dispenser (Hewlett-Packard) to give final assay concentrations 
ranging from 10 pM to 2.1 pM in 50 pl of medium with a final concentra- 
tion of 0.5% DMSO (v/v). Cells and compound were incubated at 37 °C 
for 48 h, unless otherwise indicated, followed by the addition of 20 ul 
of Steady-Glo Luciferase (Promega). Luminescence resulting from 
the induction of virally expressed luciferase was measured using an 
EnVision 2102 Multilabel Plate Reader (Perkin Elmer). Dose-response 
relationships were analysed with GraphPad Prism (v.6) using a four- 
parameter logistic regression model to calculate the concentration of 
compound that provides the half-maximal response and the maximal 
percentage activation compared to the vehicle control. 


Immunoblot analyses 

For the immunoblot assays, 10 pg of cell lysate was loaded per well into 
4-20% Tris-Glycine SDS-PAGE gels. Proteins from the SDS-PAGE gels 
were transferred to Turbo Midi PVDF Transfer Packs (BioRad) using the 
‘Mixed MW’ protocol for one Midi Format Gel (constant 2.5 Aup to 25 V, 
for 7 min) of the Trans-Blot Turbo Transfer System (BioRad) with pre- 
made Trans-Blots according to the manufacturer’s instructions. After 
transfer, PVDF membranes were blocked in 5% bovine serum albumin 
(BSA) in 1x Tris-buffered saline (TBS) (BioRad) with 0.1% Tween-20 for 
Ihat room temperature with gentle rocking. Primary antibodies were 
added and incubated overnight at 4 °C (anti-cIAP1, 1:1,000 (Abcam); 
anti-p100/p52, 1:1,000 (Cell Signaling Technology); anti-IkBa, 1:1,000 
(Cell Signaling Technology); anti-cIAP2, 1:1,000 (Abcam); and anti- 
actin-HRP conjugate, 1:30,000 (Abcam). After staining with primary 
antibodies, the membrane was washed three times with 1x TBS and 
0.1% Tween-20, 10 min each wash. After washing, the membrane was 
incubated in 5% BSA in 1x TBS and 0.1% Tween-20 with the appropriate 
secondary antibody for 2 h at room temperature. After staining with 
secondary antibodies, the membrane was washed twice for 10 min 
with 1x TBS and 0.1% Tween-20 followed by a 10-min wash with 1x TBS. 
The membrane was then patted dry with filter paper and animage was 
captured of the undeveloped membrane on a ChemiDoc MP Imaging 
System using Image Laboratory software (v.6.0.1, BioRad). Sufficient 
ECL reagent (GE Healthcare) was used to cover the membrane and a 
series of images was taken with increasing exposure times until the 
luminescence from the developed membrane saturated the image. The 
developed membrane was then washed 3 times with 1x TBS for 5 minto 
remove the residual ECL reagent and then stored at 4 °C in sufficient 
1x TBS to submerse the entire membrane. Densitometry of images of 
the developed membrane was then carried out using Image Laboratory 
software (v.6.0.1, BioRad). Some membranes were stripped for 1 min 
with One Minute Plus Western Blot Stripping Buffer (GM Biosciences) 
and then washed 3 times for 10 min with 1x TBS. The stripped mem- 
branes were then blocked in 5% BSA in 1x TBS and 0.1% Tween-20 for 
1hand re-probed overnight with anew primary antibody. To normalize 
samples for loading a1:20,000 dilution of B-actin (Abcam) was runon 
the stripped membranes. 


Target gene RT-qPCR 

We treated 2 million normal donor CD4* T cells witha range of concen- 
trations of AZD5582. Total RNA was isolated using the RNEasy Mini kit 
(Qiagen) according to the manufacturer's instructions. The following 


TaqMan primer probe sets were sourced from Applied Biosystems: 
Hs00985031 _g1 (BIRC3), HSO0174517_m1 (NFKB2) and HsO2800695 
(HPRT1). TaqMan-based quantitative PCR with reverse transcription 
(RT-qPCR; Fast Virus 1-Step Master Mix, Applied Biosystems) was used 
to amplify host genes of interest and acquire the signal ona QuantStu- 
dio 3 Real-Time PCR thermocycler (ThermoFisher). Gene expression 
was normalized to HPRT1 and the comparative threshold cycle (C,) 
method (AAC,) was used for relative quantification of gene expression. 
Relative quantification was analysed by QuantStudio 3 Real-Time PCR 
System software (v.1.4.3, ThermoFisher). 


HIV quantitative viral outgrowth 

All human peripheral-blood mononuclear cell (PBMC) samples were 
obtained under a specimen procurement protocol reviewed and 
approved by the University of North Carolina Biomedical Institutional 
Review Board and the McGill University Health Centre Ethical Review 
Board. Informed consent was obtained from all participants. Human 
PBMCs for quantitative viral outgrowth were obtained using continu- 
ous flow leukapheresis. Resting CD4* T cells were isolated and virus 
outgrowth assays were performed as previously described” with 
some modifications. In brief, 20-50 x 10° highly purified resting CD4* 
T cells were stimulated with phytohaemagglutinin, IL-2 (60 U mI”) and 
irradiated PBMCs froma seronegative donor, or with100 nM AZD5582, 
335 nM vorinostat or 0.003% DMSO (vehicle control) in limiting dilu- 
tions for 24 h. Cultures were washed to remove drugs and CCR5"2", 
CD8-depleted, phytohaemagglutinin-stimulated PBMCs from an unin- 
fected donor were added twice to amplify virus outgrowth. Culture 
supernatants were assayed for HIV p24 expression by ELISA on day 15 
and confirmed on day 19. A maximum-likelihood method was used to 
estimate the frequency of inducible virus and is reported as infectious 
units per million**. 


RNA-sequencing analysis of human cells 

Total CD4*T cells were isolated from PBMCs of four ART-treated avirae- 
mic patients by negative selection (EasySep Human CD4’ T Cell Enrich- 
ment Kit, StemCell) according to the manufacturer’s instructions. Dead 
cells and other debris were removed using a Dead Cell Removal Kit 
(Miltenyi Biotec) according to the manufacturer’s instructions. Cells 
from each patient were treated with 0.05% DMSO, 100 nM AZD5582 
or 25 nM ingenolB and collected 2 h, 6 hand 24 h after exposure. RNA 
was isolated from the collected cells using AllPrep DNA/RNA Mini Kit 
(Qiagen). Then, 200 ng of RNA from each sample was checked for qual- 
ity using an Agilent Bioanalyzer; RNA-integrity number scores were 
typically >9.0, suggesting that high-quality RNA was obtained. These 
total RNA samples were then processed into stranded, mRNA libraries 
using the KAPA library preparation kit (KAPA BioSystems, F. Hoffmann- 
La Roche). The concentrations of the final libraries were checked by 
Qubit (Thermo Fisher) and the fragment size distribution (mean size, 
359 bp) was analysed with a BioAnalyzer HS-DNA chip (Agilent). Samples 
were then sequenced using an Illumina Hiseq 4000 sequencer using 
a paired-end 50-bp by 50-bp run. Samples were successfully demul- 
tiplexed and then quality assurance and quality control was carried 
out using FASTQC (v.0.11.1) (https://www.bioinformatics.babraham. 
ac.uk/projects/fastqc/). 

Raw reads were mapped to the human genome and transcriptome 
(GRCh38.p7) using STAR® and Salmon (v.0.7.2)*°. Data were normal- 
ized and analysed for changes in gene expression using the DESeq2 
package” in R. Pvalues were adjusted for multiple testing using a false- 
discovery rate using the Benjamini- Hochberg method”. Data were 
analysed both jointly and within each treatment compared with the 
vehicle control. Differential expression of outliers was assessed and 
found to have a non-significant overall effect. Thresholds applied to 
call a significant response were mean log,-transformed fold change 
>1 and adjusted P< 0.05. Graphs and summary tables were built inR 
using ggplot. Gene-set enrichment was performed using GSEA (v.2.2.3) 


and GO analysis (GO PANTHER v.11.1)*”. Results shown are the mean 
responses of the four donors tested. 


Generation and maintenance of BLT mice 

BLT mice were prepared as previously reported’**°*". In brief, a1-2-mm 
piece of human liver tissue was sandwiched between two pieces of 
autologous thymus tissue (Advanced Bioscience Resources) under 
the kidney capsule of sublethally irradiated (200 cGy) 12-15-week- 
old female NOD.Cg-Prkdc“4 Il2rg°™!"“/Sz) (NSG; The Jackson Labora- 
tory) mice. After implantation, mice were transplanted intravenously 
with CD34* haematopoietic stem and progenitor cells isolated from 
autologous human liver tissue. Human immune cell reconstitution 
was monitored in the peripheral blood of BLT mice by flow cytometry 
every 3-4 weeks”. For the study that examined the effect of AZD5582 
administration on T cell activation, the mean weight of the mice used 
was 26.88 g and they were approximately 1 year of age at the initia- 
tion of the study. For the study that examined the effect of AZD5582 
administration on plasma and tissue viraemia during ART suppres- 
sion, the mean weight of the mice used was 23.14 g and the mice were 
approximately 7 months old at the initiation of the study. Mice were 
randomized for assignment to either experimental or control groups 
(https://www.random.org/). 


HIV infection of BLT mice 

Stocks of HIV-Lp.csp were prepared as follows. The proviral clone was 
transfected into HEK293T cells using Lipofectamine 2000 (Invitro- 
gen) following the manufacturer’s protocols. Viral supernatant was 
collected 48 h after transfection. Viral supernatant was titrated by 
infecting TZM-bl cells (NIH AIDS Reagent Program, authenticated by 
morphological identification and virus-susceptibility profiles, tested 
for mycoplasma by the supplier) at multiple dilutions. Virus-containing 
medium was removed the next day and replaced with fresh Dulbecco’s 
modified Eagle medium (DMEM; ThermoFisher) plus 10% FBS and the 
incubation continued for 24 h. The cells were fixed and stained with 
5-bromo-4-chloro-3-indolyl-8-D-galactopyranoside and blue cells were 
counted directly to determine infectious particles per ml. Each titre of 
these viral stocks was performed in triplicate and at least two different 
titre determinations were performed for each virus stock. Exposure of 
BLT mice to HIV-Lp.csp Was conducted by tail vein injection with 3 x 10* 
tissue culture infectious units of virus. Plasma viral load in peripheral 
blood of infected mice was monitored longitudinally by RT-qPCR using 
TaqMan RNA to-C, 1-step kit (Applied Biosystems). The sequences 
of the forward and reverse primers and the TaqMan probe for PCR 
amplification and detection of HIV gag RNA were: 5’-CATGTTTTCA 
GCATTATCAGAAGGA-3’, 5’-TGCTTGATGTCCCCCCACT-3’ and 5’-FA 
M-CCACCCCACAAGATTTAAACACCAT-GCTAA-Q-3’, respectively. For 
viral load analysis, 40 ul of plasma was collected and analysed witha 
sensitivity of 350 copies per ml. All samples were run and analysed on 
an ABI 7500 Fast Real Time PCR System (Applied Biosystems). 


SIV infection of NHPs 

In total, 21 male and female Indian rhesus macaques, 3-6 years of age, 
with the exclusion of MamuB*08* and MamuB*17' macaques, were 
included in this study (Supplementary Table 7). Rhesus macaques 
were infected intravenously with 3 x 10? TCIDs, (50% tissue culture 
infectious dose) Of SIVinaco39- Phe SIVmaco39 Stock was titrated in vitro 
for viral infectivity by standard end-point titration on CEMx174 cells. 
The TCID,, was calculated as previously described. Standard SlVjnaco39 
plasma viral-load quantification was performed regularly throughout 
the study and three times per week during the AZD5582 treatment 
period in the Translational Virology Core Laboratory of the Emory 
Center for AIDS Research using a standard qPCR assay (limit of detec- 
tion of 60 copies per ml of plasma) as previously described*. Ultra- 
sensitive SIV,,ac239 plasma viral-load quantification (limit of detection 
of 3 copies per ml of plasma) was performed for 2-3 time points before 
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AZD5582 treatment and 3-4 time points during AZD5582 treatment as 
previously described***. 


ART and AZD5582 treatment of BLT mice and NHPs 

ART was administered to BLT mice using irradiated Teklad chow con- 
taining emtricitabine (1,500 mg kg“), tenofovir disoproxil fumarate 
(1,560 mg kg“) and raltegravir (600 mg kg) (Research Diets). Both 
the AZD5582 and the vehicle control (10% sterile captisol dissolved in 
sterile distilled water) were prepared fresh for each administration. 
AZD5582-2HCI was obtained from ChemieTek and dissolved at5 mg mI 
in sterile distilled water (Gibco, Life Technologies) containing 10% 
captisol B-cyclodextrine sulfobutyl ethers sodium salts (Cydex Phar- 
maceuticals). AZD5582 was administered by intraperitoneal injection 
atadose of 3mgkg™. 

Rhesus macaques were treated with a potent 3-drug ART regimen 
initiated 56 days after infection that consisted of 2 reverse transcriptase 
inhibitors, tenofovir disoproxil fumarate (5.1 mg mI) and emtricitabine 
(40 mg mI) plus the integrase inhibitor dolutegravir (2.5 mg mI“). ART 
was administered once daily at 1 ml kg body weight via the subcuta- 
neous route. Peak plasma viral load (measured by the standard assay) 
and plasma viral load before LRA intervention (as measured by the 
ultrasensitive assay) were controlled for when allocating macaques 
into experimental groups. In total, 12 rhesus macaques were treated 
with AZD5582 and 9 rhesus macaques served as controls. AZD5582 was 
infused weekly intravenously at 0.1 mg kg™. Three rhesus macaques 
received 3 doses of AZD5582 and were euthanized 48 h after the last 
dose. Nine rhesus macaques received 10 doses of AZD5582 and 3 
macaques were euthanized 48 h after the last dose. Among the con- 
trol rhesus macaques, 4 macaques received a weekly placebo infusion, 
2 macaques were euthanized 48 h after 3 infusions and 2 macaques 
were euthanized 48 h after 10 infusions. The remaining five control 
macaques received ART only. 


Resting CD4* T cell enrichment 

Human resting CD4* T cells were enriched from total cells isolated 
from BLT mouse tissues as follows. Each tissue from each mouse was 
processed individually and then tissues were pooled for immunomag- 
netic sorting. Each pooled tissue was first enriched for human cells 
with the EasySep Mouse/Human Chimera Kit (Stem Cell Technolo- 
gies) and then for resting CD4° T cells with a human custom selection 
kit that included the following antibodies: CD8, CD14, CD16, CD19, 
CD20, CD36, CD56, CD123, glycophorin A, CD66b, CD25 and HLA-DR 
(all Stem Cell Technologies). Flow cytometry was performed before 
and after the enrichment to confirm the efficacy of the sort and purity 
of the sorted samples. 

Resting CD4* T cells from rhesus macaques were isolated from the 
peripheral blood, bone marrow, lymph node and spleen. Before sorting, 
CD4" T cells were enriched using magnetic beads and column purifi- 
cation (Miltenyi Biotec). Enriched CD4* T cells were then stained with 
previously determined volumes of the following fluorescently conju- 
gated antibodies: CD3 AF700 (clone SP34-2), CD8 APC-Cy7 (clone SK1), 
CD69 PE-CF594 (clone FN50), HLA-DR PerCP-Cy5.5 (clone G46-6) (all 
from BD Bioscience) and CD4 BrilliantViolet (BV)650 (clone OKT4) 
and CD25 PE-Cy7 (clone BC96) (both from Biolegend). Resting CD4* 
T cells were defined as CD3*CD4*CD8 CD69°CD25 HLA-DR . Sorting 
was performed ona FACSAria LSR II (BD Biosciences) equipped with 
FACSDiva software. 


Plasma cytokine and chemokine analysis 

Human plasma cytokine and chemokine analysis in BLT mice was per- 
formed by the University of North Carolina, Chapel Hill Center for AIDS 
Research Virology Core Laboratory. Plasma was tested undiluted in sin- 
gle wells using a Milliplex MAP kit (Millipore, HCYTMAG-60K-PX41) ona 
Luminex MAGPIX instrument. The following markers were tested: EGF, 
eotaxin, FGF-2, FLT-3L, fractalkine, G-CSF, GM-CSF, GRO, IFNa2, IFNy, 


IL-1a, IL-1B, IL-1RA, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-12p40, 
IL-12p70, IL-13, IL-15, IL-17A, IP-10, MCP-1, MCP-3, MDC, MIP-1a, MIP-1B, 
PDGF-AA, PDGF-AB/BB, RANTES, sCD40L, TGFa, TNF, TNFB and VEGF. 

Rhesus macaque plasma levels of proinflammatory cytokines and 
chemokines were evaluated using the NHP MSD V-Plex assay systems 
developed by MesoScale Discovery. The two validated kits used were 
the V-PLEX Plus Proinflammatory Panel 1 NHP Kit (K15056D) that evalu- 
ates IFNy, IL-10, IL-1f, IL-2, IL-6 and IL-8, and a custom Chemokine NHP 
Kit that evaluates IP-10, MCP-1 and MIP-18 (K15055G). The manufac- 
turer-provided protocol was followed with a few modifications. As the 
plasma samples were infectious, Triton X-100 at a final concentration 
of 1% was added not only to the subject samples but also to the assay 
diluents provided with the MSD kit before use. Calibration standards 
were reconstituted according to the protocol provided with the assay 
diluent that had the Triton X-100 added to it so that all standards and 
samples had the same components present. Plasma (60 pl) was diluted 
twofold with assay diluent and then 50 ul of the diluted sample was 
added to the assay plate along with 50 pl of the calibrator according 
the manufacturer’s protocol. The plate was then covered and incubated 
onashaker for 2h. After the 2-h incubation, plates were washed at least 
6x with the supplied wash buffer and then 25 I of detection antibodies 
was added according to the kit protocol. Subsequently, the plate was 
covered and incubated ona shaker for 2h at room temperature. After 
the detection incubation was finished, plates were then washed 6x and 
150 pl of 2x read buffer was added to each well and plates were analysed 
on the Sector s600 MSD plate reader. Data analysis was performed 
using the MSD Discovery Workbench analysis software. 


Cell-associated HIV RNA quantification in BLT mice 

For tissue RNA analysis, RNA was extracted using QlAamp viral RNA 
columns (Qiagen) according to the manufacturer’s protocol includ- 
ing an optional treatment with RNase-free DNase and analysed using 
one-step reverse-transcriptase qPCR (ABI custom TaqMan Assays- 
by-Design)**. Known quantities of HIV gag RNA standards were run 
in parallel, creating a standard curve for HIV gag and sample RNA was 
quantified by extrapolation from the standard curve. All samples were 
run and analysed on an ABI 7500 Fast Real-Time PCR System (Applied 
Biosystems). Owing to the relatively low number of human cells found 
inthe brain, HIVRNA levels were quantified for only three vehicle con- 
trol- and three AZD5582-treated mice. 


Immunohistochemical analysis in BLT mice 

Tissues for immunohistochemical analysis were collected from BLT 
mice and fixed in 10% formalin for 16-24 h at 4 °C. Samples were then 
embedded in paraffin, cut into 5-um sections and mounted onto poly- 
L-lysine-coated glass slides. After paraffin removal, antigen retrieval 
(DIVA Decloaker, Biocare Medical) and blocking of nonspecific immu- 
noglobulin-binding sites (Background Sniper, Biocare Medical), tissue 
sections were stained with anti-clAP1 antibody (R&D Systems) overnight 
at 4 °C. To detect clIAP1, sections were probed with a goat-on-rodent 
HRP polymer (Biocare Medical) and developed with diaminobenzi- 
dine (ImmPact DAB Peroxidase Substrate, Vector Laboratories). As 
an isotype control, tissue sections were stained with polyclonal goat 
IgG (R&D Systems) negative control antibodies. Tissue sections were 
imaged with a Nikon Eclipse Ci microscope using Nikon Elements BR 
software (v.4.30.01) and a Nikon Digital Sight DS-Fi2 camera. 


Serum chemistry analysis in mice 

After treatment with AZD5582 for 24 h, serum chemistry analysis 
in female 20-week-old BALB/c] mice (The Jackson Laboratory) was 
performed by the University of North Carolina, Chapel Hill Animal 
Histopathology Core Laboratory of the Lineberger Comprehensive 
Cancer Center. All clinical chemistry was performed on an Alfa Wasser- 
mann Vet Axcel analyser using Alfa Wassermann reagents. Automated 
assays for each analyte were as follows and performed according to the 


manufacturer’s instructions: alkaline phosphatase, Tietz-optimized 
Bowers and McCombassay; aspartate aminotransferase, Henry modifi- 
cation of Karmen’s assay; alanine aminotransferase, Henry modification 
of Wroblewski and LaDue assay; creatinine, Jaffe reaction; blood urea 
nitrogen, enzymatic assay; albumin, Doumas and Briggs modification 
of the bromcresol green dye method; amylase, based on the use of 
chromogenic 2-chloro-p-nitrophenol linked with maltotriose; calcium, 
calcium-arsenazo assay phosphorus, based on the method of Daly 
and Ertingshausen with modifications by Armador and Urban; total 
bilirubin, Walters and Gerarde modification of the DMSO method; total 
protein, modification of Weichselbaum’s biuret reagent. 


Immunophenotyping of BLT mice by flow cytometry 
Immunophenotyping was performed on peripheral-blood samples 
collected longitudinally and at study end point, on blood and mono- 
nuclear cells isolated from the tissues of BLT mice. All flow cytometry 
data were collected on either BD LSR Fortessa or BD FACSCanto instru- 
ments using BD FACSDiva software (v.6.1.3) and data were analysed with 
FlowJo software (v.10.4.2). Antibodies used for longitudinal monitoring 
of human cells in peripheral blood include anti-CD45 APC (clone HI30, 
BD Biosciences), anti-CD3 FITC (clone HIT3a, BD Biosciences), anti-CD4 
PE (clone RPA-T4, BD Biosciences) and anti-CD8 PerCP (clone SK1, BD 
Biosciences). Flow cytometry gating for the expression of lineage- 
specific antigens on human leukocytes was performed as follows. Step 
1, forward- and side-scatter properties were used to set a live-cell gate. 
Step 2, live cells were then analysed for expression of the human pan- 
leukocyte marker CD45. Step 3, human leukocytes were then analysed 
for human CD3* T cells. Step 4, T cells were analysed for human CD4 
and CD8 expression (Supplementary Fig. 2a). 

Atcollection, peripheral blood and cells isolated from each individual 
tissue were stained with antibodies to detect human CD45, CD3, CD4, 
CD8, CD38 (anti-CD38 APC, clone HB7, BD Biosciences) and HLA-DR 
(anti-HLA-DR PE, clone TU36, BD Biosciences) to assess T cell activation. 
The following gating strategy was used. Step 1, forward- and side-scatter 
properties were used to set a live cell gate. Step 2, live cells were then 
analysed for expression of the human pan-leukocyte marker CD45. 
Step 3, human leukocytes were subsequently analysed for human CD3* 
Tcells. Step 4, T cells were analysed for human CD4 and CD8 expression. 
Step 5, either CD4* or CD8°T cells were analysed for the expression of 
CD38 and HLA-DR (Supplementary Fig. 2b). 

To determine the success of the sort and the purity of the sorted 
human resting CD4' T cells, the following antibodies were used to ana- 
lyse pre-sort and post-sort samples: anti-CD3 BV421 (clone UCHT1, BD 
Biosciences), anti- HLA-DR PerCP (clone L243, BD Biosciences), anti-CD4 
BV605 (clone RPA-T4, BD Biosciences), anti-CD8 APC-Cy7 (clone SK1, 
BD Biosciences), anti-CD25 APC (clone 2A3, BD Biosciences) and anti- 
CD45 V500 (clone H130, BD Biosciences). Antibodies used as isotype 
controls: anti-mouse IgG1k APC (clone MOPC-21, BD Biosciences), anti- 
mouse IgG2ax PerCP (clone X39, BD Biosciences), anti-mouse IgG1k 
PE (clone MOPC-21, BD Biosciences), anti-mouse IgG1k PE-Cy7 (clone 
MOPC-21, BD Biosciences) and anti-mouse IgG2ax FITC (clone G155-178, 
BD Biosciences). Flow cytometry gating was performed as follows. Step 
1, live cells were gated based on forward scatter and side scatter. Step 2, 
human haematopoietic cells were gated based on expression of human 
CD45. Step 3, human T cells were gated based on expression of CD3. 
Step 4, T cell subsets were gated based on the expression of human CD8 
and CD4. Step 5, expression of activation markers (CD25 and HLA-DR) 
were analysed on the surface of CD4* T cells (Supplementary Fig. 2c). 


NHP sample collection and processing 

EDTA-anticoagulated blood samples were collected regularly and 
used for a complete blood count, routine chemical analysis and 
immunostaining. Plasma was separated by centrifugation within 1h 
of phlebotomy. At the end of the studies, tissue samples were col- 
lected, including lymph nodes (21 rhesus macaques), spleen (10 rhesus 


macaques) and bone marrow (11 rhesus macaques). After 2 washes in 
RPMI and removal of connective and fat tissues, lymph nodes were 
ground using a 70-um cell strainer. PBMCs and bone-marrow mono- 
nuclear cells were prepared by density-gradient centrifugation. CD4* 
T cells were negatively selected from fresh or frozen cell suspensions 
using magnetically labelled microbeads and subsequent column puri- 
fication according to the manufacturer’s protocol (Miltenyi Biotec). 


AZD5582 pharmacokinetics in NHPs 

Three healthy male rhesus macaques of Indian origin (aged 6-7 years, 
10.5-13.0 kg) were used for the study. Fasted rhesus macaque received 
0.1 mg kg of AZD5582 in 10% captisol and <5% DMSO (dose volume 
0.25 ml kg tat 0.40 mg ml’), filtered during administration (0.22-um 
PES in-line filter, Millex), via a 30-min saphenous vein infusion (infu- 
sion pump, Harvard Apparatus). Blood from femoral venipuncture 
or saphenous catheters was collected to obtain samples for plasma 
pharmacokinetics (K2 EDTA microtainers spun at 13,000 rpm for 5 min 
to obtain plasma), flow cytometry (Cytochex tubes (Streck)) and PBMC 
isolation (CPT blood collection tubes, BD Biosciences) processed 
according to the manufacturers’ instructions and a dry pellet of cells 
was snap-frozen on dry ice and stored at —80 °C at various times after 
administration. AZD5582 was extracted from macaque plasma samples 
with an isotopically labelled internal standard (rilpivirine-d6) using 
protein precipitation. The compound was then eluted from a Waters 
Atlantis T3 (SO mmx 2.1mm, 3 pm particle size) analytical column under 
reverse-phase conditions and detected onan AB Sciex API-5000 triple 
quadrupole mass spectrometer under Turbolonspray mode. Standards 
were prepared in singlet and quality controls in duplicate, and a calibra- 
tion curve was generated using a weighted 1/(x’) linear regression of 
the concentration (x) versus the analyte:internal standard peak area 
ratio (y). Concentrations of quality controls and study samples were 
calculated from this calibration curve using Sciex Analyst Software 
(v.1.6.2). The acceptance criterion of the assay was +25% of the nominal 
concentration for standards and quality controls, andthe quantifiable 
range was 2-2,000 ng mI". 


Ex vivo analysis of AZD5582 activity in NHP cells 

Splenocytes from rhesus macaques were initially processed as above 
and frozen in FBS and 10% DMSO. Cryopreserved splenocytes were 
thawed and exposed to a range of concentrations of AZD5582 for either 
48 hcontinuously or for 1h after which the drug was removed by wash- 
ing and cells were further cultured for 47 h. In preparation for western 
blot analysis, cells were lysed on ice with intermittent vortex mixing 
using NP40 Cell Lysis Buffer (Invitrogen, FNNOO21) supplemented 
with appropriately diluted 10x Protease Inhibitor Cocktail (Sigma, 
P-2714), 1mM PMSF (Sigma, 93482) and 1 mM DTT (Sigma, 43816) to 
create a final concentration of 1mM each inthe complete lysis buffer. 
The lysed cells were then centrifuged at 13,000 rpm for 10 min at 4 °C 
and then supernatants (soluble fraction lysates) were removed and 
stored at -80 °C or used immediately for protein concentration and 
western blot analysis. 


Flow cytometry assay for p100 protein 

Whole blood collected in Cyto-Chex blood collection tubes (Myriad 
RBM) was added to ACK lysis buffer to remove red-blood cells, washed 
with PBS and resuspended ina staining cocktail that included anti-CD3 
BV421 (clone SP34-2, BD Biosciences), anti-CD16 BV605 (clone SG8, BD 
Biosciences), anti-CD4 BV711 (clone L200, BD Biosciences), anti-CD14 
BV786 (clone M5E2, BD Biosciences), anti-CD123 PerCP-Cy5.5 (clone 
7G3, BD Biosciences), anti-CD20 PE-CF594 (clone 2H7, BD Biosciences), 
anti-CD8 PE-Cy7 (clone SK1, BD Biosciences), anti-CD11c Alexa700 
(clone 3.9, eBioscience) and anti-HLA-DR APC-Cy7 (clone L243, BD 
Biosciences). After surface-staining, cells were washed twice with PBS, 
permeabilized using the Cytofix/Cytoperm kit (BD Biosciences, kit 
used as directed) and stained intracellularly with anti-p100 antibody 
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(clone EPR18756; Abcam). Cells were washed twice and stained witha 
secondary chicken anti-rabbit Alexa Fluor 488 antibody (Invitrogen). 
Samples were acquired using a four-laser Fortessa flow cytometer (BD 
Biosciences) and analysed with FlowJo software (version 9.7.6, TreeStar). 


SIV env sequencing and analysis 

To generate env cDNA, reverse transcription of viral RNA was per- 
formed using SuperScript Ill reverse transcriptase according to the 
manufacturer’s directions (Invitrogen) and the gene-specific primer 
SIVEnvR15’-TGTAATAAATCCCT TCCAGTCCCCCC-3’). Single-genome 
amplification of SIV env was performed by serially diluting this cDNA 
in independent PCR reactions to identify a dilution in which amplifica- 
tion occurred in <30% of the total number of reactions. PCR amplifica- 
tion was performed with 1x PCR buffer, 2 mM MgSO,, 0.2 mM of each 
deoxynucleoside triphosphate, 0.2 1M of each primer and 0.025 U pI 
Platinum Taq High Fidelity polymerase (Invitrogen) in a 20-1 reac- 
tion. First-round PCR was performed with primer SIVEnvF1 5’-CCTC- 
CCCCTCCAGGACTAGC-3’ and antisense primer SIVEnvR1 under the 
following conditions: 1 cycle of 94 °C for 2 min, 35 cycles at 94 °C for 
15s, 55°C for 30 s and 68 °C for 5 min, followed by a final extension 
of 68 °C for 10 min. Next, 1 pl from the first-round PCR product was 
added toa second-round PCR reaction that included the sense primer 
SIVEnvF2 5’-TATAATAGACATGGAGACACCCT TGAGGGAGC-3’ and anti- 
sense primer SIVEnvR2 5’-ATGAGACATRTCTAT TGCCAATTTGTA-3’ 
performed under the same conditions used for first-round PCR, but 
with a total of 45 cycles. Correctly sized amplicons were identified 
by agarose gel electrophoresis and directly sequenced with second- 
round PCR primers and nine SIV-specific primers using BigDye Termi- 
nator technology (Applied Biosystems). To confirm PCR amplification 
froma single template, chromatograms were manually examined for 
multiple peaks, indicative of the presence of amplicons that resulted 
from PCR-generated recombination events, Taq polymerase errors or 
multiple variant templates. Alignment and phylogenetic trees were 
implemented in Geneious Prime 2019 (Biomatters) using the Muscle 
algorithm and the neighbour-joining method with the Tamura-Nei 
genetic distance model, respectively. 


Cell-associated SIV RNA and DNA quantification in NHPs 
Cell-associated SIV RNA and DNA were measured simultaneously in rest- 
ing CD4* T cells isolated from peripheral blood, spleen, bone marrow 
and lymph nodes (52,279-500,000 cells) or in total CD4* T cells iso- 
lated from peripheral blood or lymph nodes (100,000-500,000 cells), 
lysed in RLT+ Buffer (Qiagen), and stored at —80 °C. Nucleic acids were 
extracted using the AllPrep DNA/RNA mini kit (Qiagen) according to 
the manufacturer’s recommendations with an on-column DNase diges- 
tion step. Cell-associated DNA quantification of SIV,,4.739 gag DNA was 
performed on the extracted cell-associated DNA by qPCR using a 5’ 
nuclease (TaqMan) assay with SIV gag primers and normalized to the 
rhesus macaque albumin gene, as previously described“. For cell- 
associated RNA quantification, RNA was reverse-transcribed using 
the High Capacity cDNA Reverse Transcription kit (ThermoScientific) 
and random hexamers. SIV gag and the rhesus macaque CD4 gene were 
quantified by qPCR of the resultant cDNA using Taqman Universal 
Mastermix II (ThermoScientific). The CD4 primer and probe sequences 
were Rh-CD4-F 5’-ACATCGTGGTGCTAGCTTTCCAGA-3’, Rh-CD4-R 
5’-AAGTGTAAAGGCGAGTGGGAAGGA-3’ and Rh-CD4-probe 5’-AGGC- 
CTCCAGCACAGTCTATAAGAAAGAGG-3’. The means of two replicate 
wells were used in all analyses. Samples with undetectable SIV DNA or 
RNA were assigned a level of 1 copy per million cells or million CD4 RNA 
copies, respectively, for display purposes. 


SIV quantitative viral outgrowth assay 

Replication-competent SIV reservoirs were measured by the Viral Res- 
ervoir Core Laboratory of the Emory Center for AIDS Research. Latently 
infected cells were quantified using a limiting dilution culture assay in 


which CD4* T cells—enriched from lymph node or spleen cells using 
magnetic beads and column purification (Miltenyi Biotec)—were co- 
cultured with CEMx174 cells in fivefold serial dilutions ranging from 
5x 10° cells per well to 4 x 10° cells per well. The cells were cultured 
in RPMI containing 10% FBS and 100 U mI IL-2 (Sigma). The ratio of 
target cells added was 4:1 for the two highest dilutions. A constant num- 
ber of 1 x 10° CEMx174 cells was added to all other wells. The cultures 
were split every 7 days, and fresh medium was added. After 21 days, 
the growth of virus was detected by RT-qPCR. SIV RNA was isolated 
from 400 ul of culture supernatant using the Zymo viral RNA isolation 
kit (Zymo Research). DNase treatment was performed using a RQ1 
RNase-free DNase kit (Promega). A one-step RT-qPCR targeting SIV 
gag was performed using an Applied Biosystems 7500 Real Time PCR 
System (Applied Biosystems) and the Taqman Fast Virus 1-step Master 
Mix (Thermo Scientific) for qRT-PCR with the following primers and 
probe: SIVgagFwd 5’-GCAGAGGAGGAAAT TACCCAGTAC-3’, SIVgagRev 
5’-CAATTTTACCCAGGCATTTAATGTT-3’ and SIVgag probe 5’-6FAM- 
TGTCCACCTGCCAT TAAGCCCGA-3IBFQ-3’. The frequencies of infected 
cells were determined by the maximum-likelihood method* and were 
expressed as infectious units per million CD4’ T cells. 


RNA-sequencing analysis of NHP cells 

RNA-sequencing (RNA-seq) analysis was conducted at the Yerkes Non- 
human Primate Genomics Core Laboratory (http://www.yerkes.emory. 
edu/nhp_ genomics core/). RNA was purified from 50,000 peripheral- 
blood- or lymph-node-derived CD4’T cells, which were purified by flow 
cytometry and lysed in 350 pl of RLT buffer at —-80 °C, using Qiagen 
Micro RNEasy columns, and RNA quality was assessed using an Agilent 
Bioanalyzer. Then, 2 ng of total RNA was used as input for mRNA ampli- 
fication using 5’ template switch PCR with the Clontech SMART-seq v4 
Ultra Low Input RNA kit according to the manufacturer’s instructions. 
Amplified mRNA was fragmented and appended with dual-indexed bar 
codes using Illumina NexteraXT DNA library preparation kits. Libraries 
were validated by capillary electrophoresis on an Agilent 4200 TapeSta- 
tion, pooled and sequenced on an Illumina HiSeq 3000 using 100-bp 
single reads at an average depth of 25 million reads. 


RNA-seq statistical analyses 

RNA-seq data were mapped to the MacaM (v.7.8) assembly of the Indian 
rhesus macaque genome” (available at https://www.unmc.edu/rhe- 
susgenechip/index.htm) and alignment was performed with STAR 
(v.2.5.2) using the annotation as a splice junction reference. Transcripts 
were annotated using the MacaM (v.7.8.2) annotation. Transcript abun- 
dance was estimated within STAR using the htseq-count algorithm 
and differential-expression analyses were performed using DESeq2. 
For significance testing using DESeq2, all of the samples from three- 
and ten-dose rhesus macaques were grouped together, separately 
for peripheral blood and lymph nodes, and compared with their pre- 
treatment samples. Genes determined to be differentially expressed by 
DESeq2 (adjusted P< 0.05 and fold change > +1.5) were tested for enrich- 
ment of molecular pathways using the DAVID database™ (https://david. 
ncifcrf.gov/) and by using GSEA with the desktop module (https://www. 
broadinstitute.org/gsea/). Gene sets for GSEA analysis were selected 
from the MSigDB database (http://software.broadinstitute.org/gsea/ 
msigdb/index.jsp) and from the previously described ‘blood transcrip- 
tome modules". Heat maps, DAVID enrichment bar charts and GSEA 
enrichment plots were generated with the R (v.3.5.0) package ggplot2. 
Principal component analysis was performed using Partek Genomics 
Suite software (v.6.6) using a covariance matrix. 


Immunophenotyping of NHPs by flow cytometry 

Multicolour flow cytometry analysis was performed on whole blood 
and lymph-node mononuclear cells using predetermined optimal con- 
centrations of the following fluorescently conjugated monoclonal anti- 
bodies: CD3 APC-Cy7 (clone SP34-2), Ki-67 AF700 (clone B56), HLA-DR 


PerCP-Cy5.5 (clone G46-6), CCR5 APC (clone 3A9), CCR7 FITC (clone 
A20), CD45RA PE-Cy7 (clone 5H9) and CD62L PE (clone SK11) (all from 
BD Biosciences); CD8 BV711 (clone RPA-T8), CD4 BV650 (clone OKT4), 
CD95 BV605 (clone DX2) and PD-1 BV421 (clone EH12.2H7) (all from 
Biolegend) and CD28 PE-Cy5.5 (clone CD28.2) (from Beckman Coulter). 
Flow cytometry acquisition and analysis of samples was performed 
on at least 100,000 events on an LSRII flow cytometer driven by the 
FACSDiva software package (BD Biosciences). Analyses of the acquired 
data were performed using FlowJo software (Tree Star, v.10.0.4). 


ELISPOT for SIV-specific IFNy production in NHPs 

IFNy production was evaluated after the stimulation of PBMCs with 
SIVinacz30 Gag and SIV,,40239 Env peptide pools. The following reagents 
were obtained through the NIH AIDS Reagent Program, Division of 
AIDS, NIAID, NIH: SIV,,4¢239 Env Peptide Pool and the SIV,,3.239 Gag Pep- 
tide Pool. To analyse IFNy expression, we used the Monkey Elispot kit 
from MABTECH. The manufacturer’s instructions were followed as 
provided with the exception that concanavalin A (final concentration 
of 2.5 pg ml) was used as the positive control agent. ELISPOT plates 
were blocked for 30 min with 100 pl of RPEMI1640 supplemented with 
10% FBS. PBMCs (4 x 10° per well) were incubated with 1 pg ml of DMSO, 
SlVinaco39 Gag Peptide Pool or SIV,,3-239 Env Peptide Pool for 18 h before 
running the assay. For each plate, concavalin A was used for the positive 
control wells. Samples were run in duplicate. 


Intracellular cytokine staining 

PBMCs from five ART-suppressed SIV-infected rhesus macaques were 
thawed. Pre-treatment with AZD5582 was performed for 1h at100 nm. 
After two washes, phorbol 12-myristate 13-acetate (PMA) andionomycin 
were added for 1h at 500 ng ml‘ and 10 pg mI”, respectively. DMSO 
treatment controls were prepared in parallel. After 1h, brefeldin-A 
and Golgi stop solution were added following the manufacturer’s rec- 
ommendations (BD GolgiStop Protein Transport Inhibitor, BD Bio- 
sciences). Cells were incubated at 37 °C, 5% CO, inR10 for 6-8 h before 
staining with the following antibodies: CD3 APC-Cy7 (clone SP34-2), 
CD8 BV711 (clone RPA-T8), CD4 BV650 (clone OKT4) (all from Bioleg- 
end), and IFNy PE (clone B7), TNF AF700 (clone Mab11) and IL-2 BV605 
(clone MQ1-17H12) (all from BD Biosciences). Flow cytometry acquisi- 
tion and analysis of samples was performed on at least 100,000 events 
using an LSRII flow cytometer driven by the FACSDiva software package 
(BD Biosciences). Analyses of the acquired data were performed using 
FlowJo (TreeStar, v.10.0.4) and simplified presentation of incredibly 
complex evaluations (SPICE, v.6.0)” software. 


Proliferation assay 

PBMCs from five ART-suppressed SIV-infected rhesus macaques were 
thawed and labelled with CellTrace Violet Proliferation Kit accord- 
ing to the manufacturer instructions (Molecular Probes). Cells were 
plated in R10 in 96-well round-bottom plates at 4 million cells per ml. 
Pre-treatment with AZD5582 was performed for 1h at 100 nM. After 
two washes, PMA and ionomycin were added for 1h at 500 ng ml and 
10 pg ml, respectively. DMSO treatment controls were prepared in 
parallel. Cells were incubated at 37 °C, 5% CO, in R10 supplemented 
with 20 IU mI IL-2. After 5 days, cells were stained with CD3 APC-Cy7 
(clone SP34-2), CD8 BV711 (clone RPA-T8), CD4 BV650 (clone OKT4) 
(all from Biolegend) and analysed by flow cytometry on anLSRllinstru- 
ment. Data were analysed using FlowJo software (TreeStar, v.10.0.4). 


Statistical analysis 

Statistical analyses were performed using GraphPad Prism Software (v.6 
orv.7).P< 0.05 was considered statistically significant. Atleast three sam- 
ples were used for each group, the minimum to achieve statistical signif- 
icance. No statistical methods were used to predetermine sample size. 
Investigators were not blinded to group allocations or when assessing 
outcomes. In some instances, cells were pooled from individual 


humanized mice for each tissue and experimental group for the iso- 
lation of resting CD4" T cells (Fig. 2d and Extended Data Fig. 2). To test 
the statistical significance of the differences that we observed in PCR 
data in BLT mouse brains in Fig. 2e and grouped tissue PCR data in 
Fig. 2d, unpaired two-sided Student’s t-tests were used. To test the 
statistical significance of the differences that we observed in PCR data 
in PBMCs and female reproductive tract in Fig. 2e, T cell activation mark- 
ersin Supplementary Table 5 and plasma cytokines and chemokines in 
Supplementary Table 6, we used unpaired two-sided Mann-Whitney 
U-tests. To assess the statistical significance of the differences observed 
over time inserum enzymes in Supplementary Table 4, we used paired 
nonparametric Wilcoxon matched-pairs signed-rank tests. To test the 
statistical significance of the differences in activation marker levels 
expressed on T cells from rhesus macaques in Fig. 4d, e, as well as the 
ultrasensitive plasma viral load results in Extended Data Fig. 8b and the 
ex vivo proliferation assay in Extended Data Fig. 10g, we used Wilcoxon 
matched-pairs signed-rank tests. To assess the statistical significance 
of the differences observed between SIV DNA and SIV RNA levels in rest- 
ing or total CD4*T cells from rhesus macaques in Fig. 3e and Extended 
Data Fig. 6b, respectively, as well as for the quantitative viral outgrowth 
results in Extended Data Fig. 6c, we used unpaired two-sided Mann- 
Whitney U-tests. Unpaired two-sided Mann-Whitney U-tests were also 
used to compare rhesus macaques with stable compared with increased 
viraemia in Extended Data Fig. 8c. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


Source Data for Figs. 1-4, Extended Data Figs. 1, 2, 4, 6-10 and Sup- 
plementary Tables 4-6 are provided with the paper. Gene-expression 
data are available at the Gene Expression Omnibus (GEO) repository 
(accession number GSE141546 and GSE142774). Any other data are 
available from corresponding authors on reasonable request. 
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Extended Data Fig. 1| Short-duration exposure to AZD5582 activates the 
ncNF-kB pathway. a-d, Isolated total human CD4’ T cells treated with 1OO nM 
AZD5582 and then either not washed (a) or washed three times with PBS 8 h (b), 
4h(c) or1h(d) after treatment. Whole-cell lysates were then analysed by 
immunoblot for components of the ncNF-KB pathway. e, Densitometry analysis 
of the ratio of p52 to p100 from the pulse-wash assay immunoblots. Points 
represent values for the densitometric ratio from one western blot, 
representative of several independent experiments. a-e, Entire set 
representative of 1experiment with 5 replicates of the 1-h wash condition 
conducted. f, Fold induction of ncNF-«B target gene expression in isolated 
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CD4*T cells from an uninfected donor treated with the indicated 
concentrations of AZD5582 and either washed after 1h or not washed, and 
subsequently cultured for 24 has measured by RT-qPCR. Points represent two 
technical replicates and lines represent the mean. The data presented are 
representative of three independent experiments. g, DMSO-normalized 
induction of luciferase activity from the Jurkat reporter model after exposure 
to AZD5582 (10, 100 or 1,000 nM) for 30 min (blue), 1h (red), 2h (green) or 
continued exposure (purple). Points represent three replicates in one assay 
run, representative of two independent experiments. Lines represent the mean 
of the three replicates. For gel source data, see Supplementary Fig. 1. 
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Extended Data Fig. 2| AZD5582 induces HIV RNA expression in resting CD4* 
Tcells from tissues of HIV-infected, ART-suppressed BLT mice. HIVRNA 
levels in resting CD4*T cells isolated from the bone marrow, lymphnodes, liver, 
lung and spleen of control (Cntrl, blue circles) or AZD5582-treated (AZD, red 
circles) mice (cells pooled from n=6 mice per group for each tissue) were 
analysed in triplicate. Statistical significance was determined with a two-sided 
Student’s t-test. The mean fold increase in viral RNA levels in resting CD4* 
Tcells from tissues in the experiment shown in Fig. 2 was 12.1 (+3.7), whereas in 
this figure it was 8.4 (+6.0). These values were not statistically different 
(P=0.4286, two-sided Mann-Whitney U-test). Data are mean+s.e.m. 
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Extended Data Fig. 3 | AZD5582 ex vivo target engagement. a, Western blot 
analysis of p100, p52 and clAP1 protein levels in cells isolated from the bone 
marrow, lymph nodeand spleen of BLT mice before and 20 h after ex vivo 
treatment with AZD5582. Loading control, B-actin. Representative of two 
experiments. b, cIAP expression in resting CD4’T cells isolated fromthe 
thymus, spleen, lymph nodes, liver, lung and bone marrow. Loading control, 
B-actin. Representative of two experiments. c, cIAP expression in the thymic 
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organoid of HIV-infected ART-suppressed BLT mice 48 hafter the 
administration of vehicle control or AZD5582 (three control and one AZD5582- 
treated analysed). Positive cells, brown. Imaged at 4x and 40x magnifications; 
scale bars, 100 pm (4x) and 50 pm (40x). Boxes in the 4x images indicate 
regions corresponding to images at 40x magnification. For gel source data, see 
Supplementary Fig. 1. 


Article 


a 7 1,000 150 o b d f 3 
z A -e p100 MFI ES e =) sod uninfected controls AZD5582 nM Z 3 z 
2 400 ~@ [AZD5582] S e =3doses | ART-suppressed DMSO 100 20 4 0.8 0.16 0.032 0.0064 3S 2 rs) 
oA = 103 ~6doses | SIV-infected RM 
2 x = > 10 doses ~~. —" re rrr eee 100 
iB 10 100 x 
a S Q 
z 1 8 ja) 
g 8 y wr ae 2 
7) =} o | te ” 
6 04 50 8 £ 
_ oe. 2 8 a 8 8 00 05 10-15 0~=CO2.0 25 
Hours post-infusion a Hours post-infusion 
Pee ene 
Dose 3 LN cells Dose 10 LN cells 
AZD5582 group} RFKI6 RNp16 RLu16 RAj16 RDM6 RDm16RKn16 RDI6 RNq16 RQd16 
ope RFKIG —_—_—— = ee O00 
S] = RNpi6 e g 
Sl« riute RM splenocyte response to AZD5582 RM splenocyte response 
woe ee Text ——— p52 to AZD5582 
Syo RDm16 9.10 ulsed vs continuous 
g © RDI6 ee ee es actin = ¢ e 
als RKn16 on - ga 
2 2 f&E ° 
Qo 3 o 6 s 
oO 8 4 & © 2 
oe group Zz 5 4 z a 2 4 S 
@|® RAj16 & —— £ Qa =a 
3} m= RDF6 g . 2 ats 2 S, 
© 4 —— © z r 2 
2 
4 RNg16 S 3 DMSO 0.0 01 1 10 100 , 
© RQd16 20 2 
Q AZD5582 —_—_ Control q AZD5582 —_—_ Control AZD5562 [nM] SP 
8 . vero 


Extended Data Fig. 4 | Pharmacokinetic and pharmacodynamic assessment 
of AZD5582 in rhesus macaques. a, AZD5582 (0.1 mg kg”) was administered 

to healthy rhesus macaques (n = 3) by intravenous infusion. Plasma 
concentrations of AZD5582 (left y axis) are shown for the indicated time points. 
Flow cytometry was used to measure intracellular p100 levels, shown as the 
geometric mean fluorescence intensity (gMFI) inCD4* T cells and plotted as 
the percentage of baseline of p100 gMFI (right y axis). b, Plasma concentrations 
of AZD5582 after one (dark red), three (red), six (pink) or ten (orange) dosesin 
six SIV-infected ART-treated rhesus macaques (RM) and after one dose in three 
uninfected control rhesus macaques (grey). Individual values are shown as 
symbols.c, Western blot analyses of inactive p100 and active p52 forms of NF- 
«B2 inlymph-node mononuclear cells collected 48 h after the third or tenth 
dose of AZD5582 in SIV-infected ART-suppressed rhesus macaques (red; n=3 
for both the 3-dose and 10-dose groups) or at equivalent time points for 


placebo controls (blue; n =2 for both the 3-dose and 10-dose groups). 
Immunoblots are shown in the top panels and densitometry analyses of the 
p52:p100 ratios are shown inthe bottom panels. The line represents the 
median. d, Cryopreserved control rhesus macaque splenocytes were treated 
with the indicated concentrations of AZD5582 for 48 h, then p100/p52 levels 
were analysed by western blotting to measure engagement of the ncNF-KB 
pathway. e, DMSO-normalized densitometric p52:p100 ratio versus the 
AZD5582 concentration. For d, e, the experiments were performed in 
duplicate. f, Cryopreserved rhesus macaque splenocytes were exposed to 
DMSOalone (Untx), 100 nM AZD5582 washed off after 1h and cultured for 47h 
(Pulsed) or continuous 100 nMAZD5582 for 48 h (Cont.), after which the cells 
were studied by western blot for p100 and p52 levels. g, Densitometric 
p52:p100 ratio. For f, g, data represent a single experiment. For gel source data, 
see Supplementary Fig. 1. 
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Extended Data Fig. 5 | Phylogenetic trees based on full env sequencing. AZD5582 treatment (green and blue). All sequences per macaque were 
Plasma virus was sequenced from four time points per macaque (n= 5): near phylogenetically analysed and the resulting phylogenetic trees are shown for 
peak viraemia (2 weeks after infection; red), immediately before ART (8 weeks each macaque. The horizontal bar indicates the genetic distance. nt, 


after infection; orange) and two time points of on-ART viraemia during nucleotide. 
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Extended Data Fig. 6 | SIV DNA levelsin total CD4' T cells and replication- 
competent reservoir size in ART-suppressed SIV-infected rhesus macaques. 
a, Longitudinal assessment of cell-associated SIV DNA levels in total CD4* 
Tcells isolated from peripheral blood and lymph nodes of AZD5582-treated 
(n=12, red) and control (n=4, blue) ART-suppressed SIV-infected rhesus 
macaques. Grey shading represents the period of ART administration. 

b, Comparison of cell-associated SIV DNA levels in total CD4* T cells isolated 
from the lymph nodes and peripheral blood of AZD5582-treated and control 
ART-suppressed SIV-infected rhesus macaques. Total CD4* T cells were 
analysed from AZD5582-treated rhesus macaques 48 h after receiving 3 doses 
(lymph nodes and peripheral blood, n=3) or 10 doses (lymph nodes and 
peripheral blood, n=9) of AZD5582. Total CD4* T cells were analysed from 
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placebo-control rhesus macaques (lymph nodes and peripheral blood, n=4) 
at equivalent time points. Open symbols indicate AZD5582-treated rhesus 
macaques with on-ART viraemia above 60 copies per ml of plasma. Statistical 
significance was determined using atwo-sided Mann-Whitney U-test. 

c, Quantitative viral outgrowth assays were performed for AZD5582-treated 
rhesus macaques 48 h after receiving 3 doses (lymph nodes, n=2; spleen, n=3) 
or 10 doses (lymph nodes, n=3; spleen, n=2) of AZD5582. Quantitative viral 
outgrowth assays were performed from control rhesus macaques (lymph 
nodes and spleen, n= 4) at equivalent time points. Open symbols indicate 
AZD5582-treated rhesus macaques with on-ART viraemia. Statistical 
significance was determined with a two-sided Mann-Whitney U-test. 
Horizontal lines represent the median (b,c). 
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Extended Data Fig. 7| AZD5582-induced gene pathways and genes in SIV- 
infected ART-suppressed rhesus macaques. a, DAVID analysis showing 
pathways significantly enriched in genes differentially expressed after 
AZD5582 treatment relative to baseline in CD4* T cells isolated fromlymph 
nodes (black bars) and peripheral blood (grey bars). n=6 for both lymph nodes 
and peripheral blood; for each, n=3 for 3 doses of AZD5582 and n=3 for 10 
doses of AZD5582. b, Leading edge genes from the ‘hallmark TNF signalling via 
NF-kB’ pathway from MSigDB. Genes were identified in the leading edge of 
CD4* T cell samples from the lymph nodes before and after treatment with 
AZD5582 (shownin Fig. 4b). The contrast depicted is the fold change of each 
gene for each rhesus macaque’s post-treatment sample relative to the pre- 
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treatment values for CD4* T cells from the lymph nodes (top) and the 
peripheral blood (bottom). n=6 for both lymph nodes and peripheral blood; 
for each, n=3 for 3 doses of AZD5582 and n=3 for 10 doses of AZD5582.c¢, Heat 
map of cNF-KB (top) and ncNF-kB (bottom) pathway gene expression in rhesus 
macaques with (left, n=5) or without (right, n= 6) on-ART viraemia of 

>60 copies per ml plasma. One rhesus macaque without on-ART viraemia was 
excluded from this analysis because of technical issues (higher than expected 
unmapped and multi-mapped reads, and lower than expected unique 
identified reads compared to the means). Colour scale, log-transformed fold 
changes of post-treatment compared with pre-treatment values. 
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Extended Data Fig. 8 | Relationship between virological status before 
intervention and virological response to AZD5582 treatment in SIV- 
infected ART-suppressed rhesus macaques. a, Individual representations of 
plasma SIVRNA levels measured by ultrasensitive assay (limit of detection 

3 copies per ml of plasma) before and during AZD5582 treatmentin SIV- 
infected ART-suppressed rhesus macaques (n= 12). ‘Baseline’ shows two or 
three plasma viral loads before AZD5582 treatment and ‘AZD5582’ shows three 
or four plasma viral loads during AZD5582 treatment. Opensymbols indicate 
the five AZD5582-treated rhesus macaques with on-ART viraemia of >60 copies 
of SIVRNA per ml of plasma using the standard viral load assay (confirmed with 
ultrasensitive assay). The orange box highlights an additional 3 rhesus 
macaques who did not experience on-ART viraemia of >60 copies per ml of 
plasma but had =>2 SIVRNA measurements above baseline values using the 
ultrasensitive quantification method. b, Comparison of the medians of plasma 
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SIVRNA levels at baseline and during AZD5582 treatment measured by 
ultrasensitive assay for ART-suppressed SIV-infected rhesus macaques (n=12). 
Statistical significance was determined with a Wilcoxon matched-pairs signed- 
rank test. c, Comparison of the levels of plasma SIV RNA at peak, plasma SIV 
RNA before ART initiation, SIV DNA in peripheral-blood CD4' T cells before 
AZD5582 treatment (pre-LRA), SIV DNA in lymph-node CD4* T cells pre-LRA, 
SIVRNA in peripheral blood CD4* T cells pre-LRA and SIV RNA in lymph-node 
CD4*T cells pre-LRA in rhesus macaques with increased plasma viral loads 
(PVL) by standard and/or ultrasensitive assay during AZD5582 treatment 
(increased plasma viral loads, n=8) compared with rhesus macaques that did 
not demonstrate increased viral loads (stable plasma viral loads, n= 4). 
Statistical significance was determined using a two-sided Mann-Whitney 
U-test. Horizontal lines represent the median (a, c). 
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Extended Data Fig. 9 | AZD5582 can be safely administered in SIV-infected controls (blue, n= 9). Grey shading represents the period of ART 
ART-suppressed rhesus macaques. a-c, Longitudinal assessment of serum administration. AST, aspartate aminotransferase; ALT, alanine 
chemistries (a), complete blood counts (b) and weight (c) of SIV-infected ART- aminotransferase; BUN, blood urea nitrogen; GGT, y-glutamyltransferase; 


treated rhesus macaques treated with AZD5582 (red, n=12) compared with HGB, haemoglobin; WBC, white blood cell. 


Article 


a h 
2.000. PMA/ionomycin PMA/ionomycin = 104 
d + AZD5582 = 
= D104 
= & 
1,500 >. T™! 21 
8 2 io! 
31,000. 7 0 7 14 21 28 35 42 49 56 63 
Ki 104 Days relative to dose 1 
% 500 E 
[=) D> 
° rT & 
0 na ee ee eee z 
-14 0 14 28 42 56 14 0 14 28 42 56 Arc legend | Functions = 
Days relative to dose 1 Days relative to dose 1 m2 O3 7 0 7 14 21 28 35 42 49 56 63 
+ Days relative to dose 1 
c d Grn- O2 ae | 
Jame eetententens a a a tmnrat | @1 E 
8 
2 g : ‘ a 
x PMA/ionomycin Oo 
2 = 
a 10 0 7 14 21 28 35 42 49 56 63 
pd 5] 96% Days relative to dose 1 
4 8) 10 
8 g 
¢| 10 PP oJ es 
: CE ISS 
—— = Se S| 0 es Sia Bem: 
14° 0 14 28 42 56 14 0 14 28 42 56 o 7 «4-1 98 35 «42 -640~66 G3 
Days relative to dose 1 Days relative to dose 1 0 10° 104 10° Days relative to dose 1 
G Naive cm Bem CellTrace Violet € 10% 
miscm tm a p= 0.0625 ERA 
= 10 210 
Py = ve 5 = LLOD 
gag peptides env peptides 2 oye = 401 
500 ii 50 j 3 80 A a 7 0 7 14 21 28 35 42 49 56 63 
7) H i ow E * Days relative to dose 1 
8 400 @ 40 oy 
= = 8 60 x AZD5582 group 
2 300 2 30 2 Dose 3 group Dose 10 group 
E E 2 40 @RFKI6 = =© RDM16 @ RQsi6 *& RYs16 
§ 200 S 20 : @RNPIG «=A RDG RANG F RKI1G 
2 400 é 2 10 ii 3 20 & RLuI6 #e RKni6 ©& RLy15 @® RKpi6 
a 2 a ZENS z Control group 
0 a : o - Z =". 6 ? @RAN6 =H RKwi6 @ RQG6 
42-28 -14 0 14 28 42 56 42-28 -14 0 14 28 42 56 HERDHG ge RUSIG FFRFVIS py agT 
Days relative to dose 1 Days relative to dose 1 sh RNQ16 —-@ RKZ15_ + RZe15 
Extended Data Fig. 10 | Immunological effect of AZD5582 on SIV-infected (left) or presence (right) of AZD5582 pre-treatment. g, Memory CD8°T cell 
ART-suppressed rhesus macaques. a,b, Longitudinalassessmentofthecount proliferative response to stimulation with PMA and ionomycin with or without 
(a) and the frequency and viability (b) of CD4* T cells in SIV-infected ART- AZD5582 pre-treatment. Top, representative flow cytometry dot plots gated 
treated rhesus macaques during the period of AZD5582 treatment (n=12). onmemory CD8* T cells. Bottom, comparison between divided cells 5 days 
Lines represent median values. c,d, CD4’ T cell naive and memory subset after stimulation in each group (n=5 for each). Statistical significance was 
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rhesus macaques during the period of AZD5582 treatment (n=12). Lines represent the median. h, Longitudinal assessment of plasma levels of IP-10, 


represent median values. e, SIVgag-specific (left) and SIV env-specific (right) MIP-1B, MCP-1, IL-6 and IL-10 by multiplex assay in SIV-infected ART-suppressed 
CD8‘IFNy* T cell responses in SIV-infected, ART-suppressed, AZD5582-treated rhesus macaques treated with AZD5582 (red, n= 6) or control rhesus macaques 
(red, n=6) and control (blue, n=4) rhesus macaques. SFU, spot-forming units. (blue, n=4). An additional four analytes (IFNy, IL-8, IL-1B and IL-2) were 

f, Pie charts depicting the ability of memory CD8'T cells isolated from SIV- undetectable in all macaques. a-e, h, Grey shading represents the period of 
infected ART-suppressed control rhesus macaques (n=5) to produce IFNy, IL-2 ART administration. Dashed lines represent AZD5582 or placebo infusions. 
and/or TNF in response to stimulation with PMA and ionomycin inthe absence LLOD, lower limit of detection. 
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Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
Lo AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
— Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 
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Sample size 


Data exclusions 


Replication 


Randomization 


Blinding 


For BLT mouse studies, no statistical methods were used to predetermine sample size. At least 3 animals were used for each experimental 
group, the minimum to achieve statistical significance. 

Based on our previous data on SIV-infected ART-treated rhesus macaques, with a sample size of at least 7, we would be able to detect a 
significant difference between pre- and post- AZD5582 treatment samples in the levels of plasma RNA at the 0.05 significance level with a 


power of 0.90. 


Data exclusion was applied only to one RNAseq analysis represented on the heatmap of extended data Figure 7. One RM without on-ART 
viremia was excluded from this analysis for technical issues (higher than expected unmapped and multi-mapped reads, and lower than 
expected unique identified reads compared to the means). 


n Figure 1b, symbols represent technical replicates of DMSO-normalized reporter signal induced by a dose titration of a panel of mono- and 
bivalent SMACm in a Jurkat luciferase reporter model of HIV-1 latency with 48 h exposure. In Extended Data Fig. 1f, fold induction of ncNF-kB 
target gene expression was measured by quantitative RT-PCR. Points represent two technical replicates. The data presented are 
representative of three independent experiments. In Extended Data Fig. 1g, for DMSO-normalized induction of luciferase activity from the 
urkat reporter model after exposure to AZD5582, points represent three replicates in one assay run, representative of several independent 


experiments. All attempts at replication were successful. In two independent experiments, plasma viremia was observed following AZD5582 
administration to HIV-infected, ART-suppressed BLT mice. The use of non-human primates precludes our ability to replicate experiments. 
Sample sizes were chosen to maximize the likelihood of detecting statistical differences. 


For the study that examined the impact of AZD5582 administration on plasma and tissue viremia in BLT mice during ART suppression, mice 
were randomized for assignment to either experimental or control groups using randomization software available at random.org. For RM 
studies, peak plasma viral load (measured by standard assay) and plasma viral load before LRA intervention (as measured by ultrasensitive 
assay) were controlled for when allocated animals into experimental groups. 


Investigators were not blinded to group allocations or when assessing outcomes. In some instances, cells were pooled from individual 
humanized mice for each tissue and experimental group for the isolation of resting CD4+ T cells (Fig. 2). 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
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Antibodies used 


rat anti-mouse CD24-biotin (clone M1/69), BD Biosciences (Cat. #557436), 10ug biotin-labeled rat anti-mouse CD24 antibody 
was adsorbed to 1mg streptavidin-labeled magnetic Dynabeads ,https://www.bdbiosciences.com/us/applications/research/ 
stem-cell-research/cancer-research/mouse/purified-rat-anti-mouse-cd24-m169/p/557436 

anti-clAP1-unconjugated (clone EPR4673), Abcam (Cat. # 108361), 1:1,000 dilution, https://www.abcam.com/ciap1-antibody- 


epr4673-ab108361.html 


anti-p100/p52-unconjugated (clone 18D10), Cell Signaling Technology (Cat. # 3017), 1:1,000 dilution, https:// 
www.cellsignal.com/products/primary-antibodies/nf-kb2-p100-p52-18d10-rabbit-mab/3017 

anti-IkBa-unconjugated (clone 44D4), Cell Signaling Technology (Cat. # 4812), 1:1,000 dilution, https://www.cellsignal.com/ 
products/primary-antibodies/ikba-44d4-rabbit-mab/4812 
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anti-clAP2-unconjugated (clone E40), Abcam (Cat. # ab32059), 1:1,000 dilution, https://www.abcam.com/ciap2-antibody-e40- 


ab32059.html 
anti-beta-actin-HR 


P (clone AC-15 ), Abcam (Cat. # ab49900), 1:30,000 dilution, https://www.abcam.com/beta-actin-antibody- 


ac-15-hrp-ab49900.html 
anti-clAP1-unconjugated (goat polyclonal IgG), R&D Systems (Cat. # AF8181, Lot # KH50516111), 10 ug/ml dilution, https:// 
www.rndsystems.com/products/human-ciap-1-hiap-2-antibody_af8181 


anti-CD3-BV421 (c 


one SP34-2), BD Biosciences (Cat. # 562877), 1:250 dilution, https://www.bdbiosciences.com/us/reagents/ 


research/antibodies-buffers/immunology-reagents/anti-human-antibodies/cell-surface-antigens/bv421-mouse-anti-human-cd3- 


5p34-2/p/562877 

anti-CD16-BV605 ( 
research/stem-cel 
anti-CD4-BV711 (c 


clone 3G8), BD Biosciences (Cat. # 563172), 1:50 dilution, https://www.bdbiosciences.com/us/applications/ 
-research/cancer-research/human/bv605-mouse-anti-human-cd16-3g8/p/563172 
one L200), BD Biosciences (Cat. # 563913), 1:50 dilution, https://www.bdbiosciences.com/us/reagents/ 


research/antibodies-buffers/immunology-reagents/anti-non-human-primate-antibodies/cell-surface-antigens/bv7 11-mouse- 


anti-human-cd4-|2: 
anti-CD14-BV786 ( 


00/p/563913 
clone M5E2), BD Biosciences (Cat. # 563698), 1:50 dilution, https://www.bdbiosciences.com/us/applications/ 


research/stem-cel 
p/563698 


anti-CD123-PerCP- 


applications/resea 


-research/hematopoietic-stem-cell-markers/human/negative-markers/bv786-mouse-anti-human-cd14-m5e2/ 


Cy5.5 (clone 7G3), BD Biosciences (Cat. #558714), 1:25 dilution, https://www.bdbiosciences.com/us/ 
rch/b-cell-research/surface-markers/human/percp-cy55-mouse-anti-human-cd123-7g3/p/558714 


anti-CD20-PE-CF594 (clone 2H7), BD Biosciences (Cat. # 562295), 1:500 dilution, https://www.bdbiosciences.com/us/ 


applications/resea 


rch/stem-cell-research/hematopoietic-stem-cell-markers/human/negative-markers/pe-cf594-mouse-anti- 


human-cd20-2h7/p/562295 

anti-CD8-PE-Cy7 (clone SK1), BD Biosciences (Cat. #335787), 1:500 dilution, https://www.bdbiosciences.com/us/reagents/ 
research/clinical-research---ruo-gmp/single-color-antibodies/pe-cytrade7-mouse-anti-human-cd8-sk1/p/335787 
anti-CD11c-Alexa700 (clone 3.9), Ebioscience (Cat. # 50-112-9413), 1:50 dilution, https://www.thermofisher.com/antibody/ 


product/CD11c-An 


tibody-clone-3-9-Monoclonal/56-0116-42 


anti-HLA-DR-APC-Cy7 (clone L243), BD Biosciences (Cat. # 335796), 1:50 dilution, https://www.bdbiosciences.com/us/ 


applications/resea 


rch/stem-cell-research/mesenchymal-stem-cell-markers-bone-marrow/human/negative-markers/apc- 


cytrade7-mouse-anti-human-hla-dr-1243/p/335796 


anti-p100-unconju 


gated (clone EPR18756), Abcam (Cat. # ab191594), 1:25 dilution, https://www.abcam.com/nfkb-p100nfkb2- 


antibody-epr18756-ab191594.html 
anti-CD45-APC (clone HI30), BD Biosciences (Cat. #555485), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/stem-cell 


-research/cancer-research/human/apc-mouse-anti-human-cd45-hi30/p/555485 


anti-CD3-FITC (clone HIT3a), BD Biosciences (Cat. # 555339), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 
research/t-cell-immunology/th-1-cells/surface-markers/human/fitc-mouse-anti-human-cd3-hit3a/p/555339 


anti-CD19-PE (clon 


e HIB19), BD Biosciences (Cat. #555413), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/clinical-research/oncology-research/blood-cell-disorders/surface-markers/human/pe-mouse-anti-human-cd19-hib19/ 


p/555413 


anti-CD4-PerCP (clone SK3), BD Biosciences (Cat. # 347324), 3 ul/test, https://www.bdbiosciences.com/us/applications/research/ 
t-cell-immunology/th-1-cells/surface-markers/human/percp-mouse-anti-human-cd4-sk3-also-known-as-leu3a/p/347324 


anti-CD4-PE (clone 


RPA-T4), BD Biosciences (Cat. #555347), 3 ul/test, https://www.bdbiosciences.com/us/applications/research/ 


t-cell-immunology/th-1-cells/surface-markers/human/pe-mouse-anti-human-cd4-rpa-t4/p/555347 
anti-CD8-PerCP (clone SK1), BD Biosciences (Cat. # 347314), 3 ul/test, https://www.bdbiosciences.com/us/reagents/research/ 


clinical-research--- 


ruo-gmp/single-color-antibodies/percp-mouse-anti-human-cd8-sk1/p/347314 


anti-CD45-APC-Cy7 (clone 2D1), BD Biosciences (Cat. # 557833), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/stem-cell 


-research/cancer-research/human/apc-cy7-mouse-anti-human-cd45-2d1/p/557833 


anti-CD3-PE-Cy7 (clone SK7), BD Biosciences (Cat. #557851), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 
research/t-cell-immunology/th-1-cells/surface-markers/human/pe-cy7-mouse-anti-human-cd3-sk7-also-known-as-leu-4/ 


p/557851 


anti-CD8-FITC (clone SK1), BD Biosciences (Cat. # 340692), 3 ul/test, https://www.bdbiosciences.com/us/applications/clinical/ 
blood-cell-disorders/asr-reagents/cd8-fitc-sk1/p/340692 

anti-CD38-APC (clone HB7), BD Biosciences (Cat. # 340439), 3 ul/test, https://www.bdbiosciences.com/us/applications/research/ 
t-cell-immunology/regulatory-t-cells/surface-markers/human/apc-mouse-anti-human-cd38-hb7/p/340439 

anti-HLA-DR-PE (clone TU36), BD Biosciences (Cat. #555561), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/stem-cel 


-research/mesenchymal-stem-cell-markers-bone-marrow/human/negative-markers/pe-mouse-anti-human- 


hla-dr-tu36-also-known-as-t36-t36/p/555561 


anti-CD3-BV421 (c 


one UCHT1), BD Biosciences (Cat. # 562426), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/t-cell-immunology/th-1-cells/surface-markers/human/bv421-mouse-anti-human-cd3-ucht1-also-known-as-ucht-1- 


ucht-1/p/562426 


anti-HLA-DR-PerCP (clone L243), BD Biosciences (Cat. # 347364), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/stem-cel 


-research/mesenchymal-stem-cell-markers-bone-marrow/human/negative-markers/percp-mouse-anti- 


human-hla-dr-l243/p/347364 


anti-CD4-BV605 (c 


one RPA-T4), BD Biosciences (Cat. #562658), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/t-cell-immunology/th-1-cells/surface-markers/human/bv605-mouse-anti-human-cd4-rpa-t4/p/562658 


anti-CD8-APC-Cy7 


clone SK1), BD Biosciences (Cat. # 557834), 3 ul/test, https://www.bdbiosciences.com/us/reagents/research/ 


antibodies-buffers/immunology-reagents/anti-human-antibodies/cell-surface-antigens/apc-cy7-mouse-anti-human-cd8-sk1/ 


p/557834 


anti-CD25-APC (clone 2A3), BD Biosciences (Cat. # 340938), 3 ul/test, https://www.bdbiosciences.com/us/applications/clinical/ 
blood-cell-disorders/asr-reagents/cd25-apc-2a3/p/340938 
anti-CD45-V500 (clone H130), BD Biosciences (Cat. #560777), 3 ul/test, https://www.bdbiosciences.com/us/applications/ 


research/stem-cell 


-research/cancer-research/human/v500-mouse-anti-human-cd45-hi30/p/560777 


anti-mouse IgG1k-APC (clone MOPC-21), BD Biosciences (Cat. #555751), 3 ul/test, https://www.bdbiosciences.com/us/reagents/ 
research/antibodies-buffers/immunology-reagents/anti-human-antibodies/cell-surface-antigens/apc-mouse-igg1-isotype- 
control-mopc-21/p/555751 

anti-mouse IgG2ak-PerCP (clone X39), BD Biosciences (Cat. # 340765), 3 ul/test, https://www.bdbiosciences.com/us/ 
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applications/clinical/blood-cell-disorders/asr-reagents/mouse-iggsub2asub-percp-x39/p/340765 

anti-mouse IgG1k-PE (clone MOPC-21), BD Biosciences (Cat. #559320), 3 ul/test, https://www.bdbiosciences.com/us/ 
applications/research/intracellular-flow/intracellular-antibodies-and-isotype-controls/anti-human-antibodies/pe-mouse-igg1- 
isotype-control-mopc-21/p/559320 

anti-mouse IgG1k-PE-Cy7 (clone MOPC-21), BD Biosciences (Cat. #557872), 3 ul/test, https://www.bdbiosciences.com/us/ 
reagents/research/antibodies-buffers/immunology-reagents/anti-human-antibodies/cell-surface-antigens/pe-cy7-mouse-igg1- 
isotype-control-mopc-21/p/557872 

anti-mouse IgG2ak-FITC (clone G155-178), BD Biosciences (Cat. #553456), 3 ul/test, https://www.bdbiosciences.com/us/ 
reagents/research/antibodies-buffers/immunology-reagents/anti-mouse-antibodies/cell-surface-antigens/fitc-mouse-igg2a- 
isotype-control-g155-178/p/553456 

anti-CD3-APC-Cy7 (clone SP34-2), BD Biosciences (Cat. #557757), 
https://www.bdbiosciences.com/us/reagents/research/antibodies-buffers/immunology-reagents/anti-non-human-primate- 
antibodies/cell-surface-antigens/apc-cy7-mouse-anti-human-cd3-sp34-2/p/557757 

anti-Ki-67-AF700 (clone B56), BD Biosciences (Cat. #561277), https://www.bdbiosciences.com/us/applications/research/ 
intracellular-flow/intracellular-antibodies-and-isotype-controls/anti-human-antibodies/alexa-fluor-700-mouse-anti-ki-67-b56/ 
p/561277 

anti-HLA-DR-PerCP-Cy5.5 (clone G46-6), BD Biosciences (Cat. #560652), https://www.bdbiosciences.com/us/applications/ 
research/stem-cell-research/mesenchymal-stem-cell-markers-bone-marrow/human/negative-markers/percp-cy55-mouse-anti- 
human-hla-dr-g46-6/p/560652 

anti-CCR5-APC (clone 3A9), BD Biosciences (Cat. # 550856), https://www.bdbiosciences.com/us/applications/research/t-cell- 
immunology/th-1-cells/surface-markers/human/apc-mouse-anti-human-cd195-3a9/p/550856 

anti-CD8-BV711 (clone RPA-T8), BD Biosciences (Cat. # 563677), https://www.bdbiosciences.com/us/reagents/research/ 
antibodies-buffers/immunology-reagents/anti-human-antibodies/cell-surface-antigens/bv711-mouse-anti-human-cd8-rpa-t8/ 
p/563677 
anti-CD4-BV650 (clone OKT4), Biolegend (Cat. # 317436), https://www.biolegend.com/en-us/products/brilliant-violet-650-anti- 
human-cd4-antibody-7786 

anti-PD-1-BV421 (clone EH12.2H7), Biolegend (Cat. # 329920), https://www.biolegend.com/en-us/products/brilliant-violet-421- 
anti-human-cd279-pd-1-antibody-7191 

anti-CD3-AF700 (clone SP34-2), BD Bioscience (Cat. #557917), https://www.bdbiosciences.com/us/reagents/research/ 
antibodies-buffers/immunology-reagents/anti-non-human-primate-antibodies/cell-surface-antigens/alexa-fluor-700-mouse-anti- 
human-cd3-sp34-2/p/557917 
anti-CD69-PE-CF594 (clone FN50), BD Bioscience (Cat. # 562617), https://www.bdbiosciences.com/us/applications/research/t- 
cell-immunology/regulatory-t-cells/surface-markers/human/pe-cf594-mouse-anti-human-cd69-fn50-also-known-as-fn-50/ 
p/562617 
anti-CD25-PE-Cy7 (clone BC96), Biolegend (Cat. # 302612), https://www.biolegend.com/en-us/products/pe-cy7-anti-human- 
cd25-antibody-1909 
anti-CD45RA-PE-Cy7 (clone5H9), BD Biosciences (Cat. # 561216), https://www.bdbiosciences.com/us/applications/research/b- 
cell-research/surface-markers/non-human-primates/pe-cy7-mouse-anti-human-cd45ra-5h9/p/561216 

anti-CD62L-PE (clone SKII), BD Biosciences (Cat. # 654666), https://www.bdbiosciences.com/us/applications/clinical/blood-cell- 
disorders/asr-reagents/cd62|-pe-sk11-also-known-as-anti-leu-8/p/654666 

anti-CD95-BV605 (clone DX2), Biolegend (Cat. # 305627), https://www.biolegend.com/en-us/products/brilliant-violet-605-anti- 
human-cd95-fas-antibody-8778 

anti-CD28-PE-Cy5.5 (clone CD28.2), Beckman Coulter (Cat. # B24027), https://www.beckman.com/reagents/coulter-flow- 
cytometry/antibodies-and-kits/single-color-antibodies/cd28/b24027 
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Validation The specificity of the antibodies purchased from commercial sources (BD Biosciences, Abcam, Cell Signaling Technology, R&D 
Systems, Ebioscience, Biolegend, and Beckman Coulter) were validated by the manufacturer as noted on their website (links 
provided above for each antibody). 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) Jurkat Clone E6-1 cells (American Type Culture Collection TIB-152), TZM-bl cells (NIH AIDS Reagent Repository) and HEK 293T 
cells (European Collection of Authenticated Cell Cultures) 


Authentication Cell lines were authenticated by morphological identification and virus susceptibility profiles. 
Mycoplasma contamination Cell lines were tested negative for mycoplasma by the supplier 


Commonly misidentified lines No commonly misidentified cell lines were used 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals BLT mice were constructed using 12-15 week old female NOD.Cg-Prkdcscid Il2rgtm1Wjl/SzJ mice (NSG; The Jackson Laboratory, 
Bar Harbor, ME) mice. Female 20 week old BALB/cJ (The Jackson Laboratory, Bar Harbor, ME) were used for the serum chemistry 
analysis. Three healthy male rhesus macaques (Macaca mulatta) of Indian origin, age 6-7 years, were utilized for the AZD5582 
pharmacokinetic study. Twenty-one male and female Mamu-B*08 and —B*17 negative rhesus macaques , age 3-6 years, were 
infected with SIVmac239 and treated with ART (Supplementary Table 7). 


Wild animals The study did not involve wild animals. 
Field-collected samples The study did not involve samples collected from the field. 
Ethics oversight ice were maintained under specific pathogen-free conditions by the Division of Comparative Medicine at the University of 


orth Carolina, Chapel Hill. Mouse experiments were conducted in accordance with NIH guidelines for the housing and care of 
aboratory animals and in accordance with protocols reviewed and approved by the Institutional Animal Care and Use 
Committee at the University of North Carolina, Chapel Hill. Healthy Rhesus macaques for pharmacokinetic studies were housed 
at GlaxoSmithKline and all procedures were conducted in accordance with the GlaxoSmithKline Policy on the Care, Welfare, and 
Treatment of Laboratory Animals and were reviewed by the IACUC at GlaxoSmithKline. Rhesus macaques infected with SIV were 
housed at the Yerkes National Primate Research Center (Atlanta, GA) and treated in accordance with Emory University and 
Yerkes National Primate Research Center Institutional Animal Care and Use Committee regulations. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 
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Policy information about studies involving human research participants 


Population characteristics Cells used in the QVOA and RNAseq experiments were obtained from participants stably suppressed on ART. At the time of 
sample donation, participants had a mean age of 43 years [range, 26-61 years], a mean duration on ART of 7 years [range, >6 
months -22 years] and a mean CD4 count of 634 [range, 372-1364 cells/ul]. All participants were male , and 88% were 
Caucasians and 12% African American. Twenty-five percent of the participants were treated during acute infection and 75% 
during chronic infection. 


Recruitment Cells used in the QVOA assays were selected randomly across participants enrolled in a longitudinal reservoir measurement 
study and thus should not be subjected to self-selection bias. For the RNA seq experiments, cells from participants with a 
demonstrated increase in cell associated HIV RNA following ex-vivo exposure to AZD5582 were selected, potentially introducing 
a self-selection bias. However, given that these were global human gene expression measurements, we do not believe our 
results were impacted by this bias. 


Ethics oversight All human subjects samples were obtained under a specimen procurement protocol reviewed and approved by the University of 
North Carolina Biomedical Institutional Review Board and the McGill University Health Centre Ethical Review Board. Informed 
consent was obtained from all participants. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a 'group' is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 


Sample preparation Sample preparation for the flow cytometric analysis of peripheral blood and tissues from humanized mice and rhesus macaques 
is detailed in the Methods section. 


Instrument Flow cytometry data was collected on BD LSRII, BD LSRFortessa, BD FACSAria LSR II, or BD FACSCanto instruments using BD 
FACSDiva software. 


Software Flow cytometry data was analyzed with FlowJo software. 


Cell population abundance _ Resting CD4+ T cells represented 0.12-9.48% (mean: 3.42%) of the total cell population. Prior to sorting of macaque resting CD4+ 
T cells by FACS, CD4+ T cells were enriched by magnetic bead selection. Post-sort purity was 97.8%. 


Gating strategy For the analysis of the frequency and phenotype of different human immune cell populations in the peripheral blood and tissues 
of BLT mice, an antibody specific for human CD45, a pan leukocyte marker, was used first to gate human leukocytes. Gates to 
define positive and negative populations were defined by isotype controls when appropriate. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 
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Glioblastoma is a universally lethal form of brain cancer that exhibits an array of 
pathophysiological phenotypes, many of which are mediated by interactions with the 
neuronal microenvironment’”. Recent studies have shown that increases in neuronal 
activity have an important role in the proliferation and progression of glioblastoma**. 


Whether there is reciprocal crosstalk between glioblastoma and neurons remains 
poorly defined, as the mechanisms that underlie how these tumours remodel the 
neuronal milieu towards increased activity are unknown. Here, using a native mouse 
model of glioblastoma, we develop a high-throughput in vivo screening platform and 
discover several driver variants of PIK3CA. We show that tumours driven by these 
variants have divergent molecular properties that manifest in selective initiation of 
brain hyperexcitability and remodelling of the synaptic constituency. Furthermore, 
secreted members of the glypican (GPC) family are selectively expressed in these 
tumours, and GPC3 drives gliomagenesis and hyperexcitability. Together, our studies 
illustrate the importance of functionally interrogating diverse tumour phenotypes 
driven by individual, yet related, variants and reveal how glioblastoma alters the 
neuronal microenvironment. 


Inthe pursuit of cataloguing the spectrum of somatic mutations for all 
tumour types, cancer genomics is rapidly approaching a state of satura- 
tion mutagenesis®. These crucial advances promise to usher inanewera 
of personalized medicine, in which patient-specific genomic informa- 
tionis applied in the clinic. One major obstacle in achieving this goal is 
decoding driver and passenger mutations from these cohorts, as not 
all mutations affect malignancy, whereas others exert context-specific 
functions. Mutation ‘hotspots’ represent variants in key driver genes 
that occur with high frequency and are found across different types of 
cancer®. Although widely accepted as the convention for prospective 
selection of driver mutations, this approach overlooks most mutations 
and, crucially, does not account for how cellular context influences vari- 
ant function’ ’. Whether related variants exert differential effects on 
both tumour and microenvironmental phenotypes remains unknown. 
These limitations illustrate the need for high-throughput, functional 
and phenotypic screening of individual variant cohorts in contextually 
appropriate in vivo systems. 


Identifying PIK3CA driver variants 

The RTK-RAS-PI3K pathway is a key driver of tumorigenesis across all 
cancers, with 90% of glioblastoma tumours exhibiting alterations in this 
pathway””*. Among the specific genes in this pathway, mutations in the 


PI3K catalytic subunit PIK3CA are found in 11% of glioblastoma tumours”. 
Sequencing of glioblastoma samples revealed several known hotspot 
mutations that drive tumorigenesis in several cancer lineages (E545K and 
H1047R), as well as a series of 63 in-frame mutations (as indexed in 
the Catalogue Of Somatic Mutations In Cancer (COSMIC)) that remain 
largely unclassified (Fig. 1a, Supplementary Table 1). To decode which 
of these PIK3CA variants function as drivers of glioblastoma, we estab- 
lished anin vivo complementation screening platform for glioblastoma. 
Our mouse glioblastoma model relies on in utero electroporation (IUE) 
and CRISPR-Cas9-mediated knockout of Nf1, 7rp53 (also known as p53) 
and Pten (termed ‘3xCr’), with 50% of the mice succumbing to tumours 
by postnatal day (P) 84% (Fig. 1d, black line, Extended Data Table 1). 
Removing the Pten guide RNA (termed ‘2xCr’) extends median survival 
to P203”, providing ample dynamic range to screen for factors that 
complement the loss of PTEN (Fig. 1d, grey line, Extended Data Table 1). 
Because PTEN catalyses the reverse reaction of PIK3CA—phosphoryla- 
tion of phosphatidylinositol-4,5-bisphosphate (PtdIns(4,5)P,, or PIP,) to 
phosphatidylinositol-3,4,5-trisphosphate (PtdIns(3,4,5)P3, or PIP;)—we 
complemented Pten loss with overexpression of E545K or H1047R—two 
bona fide hotspot driver mutations of PIK3CA™. Both of these variants 
accelerated tumour growth, which demonstrates that known PIK3CA 
drivers complement PTEN loss in our system (Fig. 1d, red and green lines, 
Extended Data Fig. 1a, b, Extended Data Table 1). 
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Fig. 1|In vivo screening identifies novel driver PIK3CA variants in glioma. 
a, Glioblastoma-associated PIK3CA mutations indexed in COSMIC (by June 
2019): missense (green), in-frame deletions and insertions (brown), or 
nonsense (pink) (Supplementary Table 1). b, Schematic of in vivo competition 
assay in which Nf1 and 7rp53 are deleted (2xCr) and 27 uniquely barcoded 
Pik3ca alleles are co-electroporated.c, Next-generation sequencing for 
barcode amplification, showing the barcode for eachallele (red) and the input 
signal (black). n=3 tumours. Data are mean and s.e.m. d, Kaplan-Meier curve of 
all tested variants. n= 42 (3xCr);n=28 (Cherry); n=20 (C420R); n=32 (E545K); 
n=30(H1047R); n=19 (M10431); n=21(R88Q);n=29 (WT); n=20 (V638A); 
n=20(H70I1P). See Extended Data Table 1 for full survival statistics. 


Our finding that established P/K3CA hotspots can drive in vivo tumo- 
rigenesis led us to screen a cohort of PIK3CA variants found in human 
glioblastoma (Fig. 1a, Supplementary Table 1). To achieve this, we used 
DNA barcoding together with a pooled screening strategy’”"* (Methods) 
inour sensitized 2xCr model (Fig. 1b, Extended Data Fig. 2). After glioma 
formation, we used targeted sequencing of tumours for barcodes as 
a surrogate for variant enrichment within the tumours. This analysis 
demonstrated that the established hotspots (E545K and H1047R) are 
enriched along with other variants (R88Q and C420R) that have not 
been characterized in glioma (Fig. 1c). To confirm driver function of 
these variants, we individually expressed them in the 2xCr model, find- 
ing that they all complement the loss of PTEN and drive tumorigenesis, 
while also pathologically resembling glioblastoma (Fig. 1d, Extended 
Data Figs. 1a, b, 3, Extended Data Table 1). These data indicate that our 
approach can identify established hotspots, as well as new PIK3CA 
driver variants in glioma. 


PIK3CA tumours exhibit synaptic profiles 

Although the tested driver variants accelerated tumour-associated 
death compared to the wild-type PIK3CA control, they did soto varying 
degrees (Fig. 1d). To understand a molecular basis for differences in 
tumorigenic potential between these variants, we performed reverse- 
phase protein array (RPPA) analysis on lysates collected from tumours 
driven by each variant, finding differential activation of PI3K pathway 
components (Fig. 2a). C420R and H1047R demonstrated the high- 
est PI3K activity and were also the strongest drivers of tumorigenesis 
(Fig. 1d), whereas E545K, R88Q and M1043] exhibited less activation. 
These differences in PI3K tone and survival suggest that individual 
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Fig. 2| Tumours driven by PIK3CA variants exhibit diverse molecular 
properties. a, Tumour lysate RPPA for proteins of the PISK-AKT-MTOR 
pathway”; n=2 per variant. b, Principal component (PC) analysis against top 
2,000 variable genes across variant tumours from RNA-seq data. n= 2 (normal 
brain); n=4 (C420R); all other variants n=3.c, Enrichment analysis of the top 
genes increased or decreased in C420R and H1047R variants (P< 0.01 for both 
wild-type and Cherry control comparisons, by linear model) for gene sets 
associated with synapse function or proliferation. Number above the bars 
represent genes from each set involved in observed significant overlap. P 
values were determined by one-sided Fisher’s exact test; n=4 (C420R);n=3 
(H1047R).d, Differential gene expression patterns (relative to average of wild- 
type and Cherry controls) for tumours driven by C420R or H1047R for 
proliferative or synaptic genes. 


variants promote tumorigenesis through distinct mechanisms, which 
we examined via RNA sequencing (RNA-seq) analysis of end-stage 
tumours driven by these variants. Principal component analysis of the 
transcriptomes revealed that tumours driven by E545K, M10431, R88Q, 
wild-type PIK3CA and the 2xCr control are highly correlated (Fig. 2b). 
Notably, principal component analysis revealed that the transcriptomes 
of tumours driven by H1047R and C420R are vastly different from this 
core set of variants and from one another (Fig. 2b, circles). Focusing 
onthe molecular differences manifest in H1047R- and C420R-driven 
tumours, we performed Gene Ontology analysis of the transcriptome 
data, and found that genes associated with proliferation and synapses 
are differentially expressed (Fig. 2c, Supplementary Table 2). Although 
C420R tumours demonstrated a notable increase in proliferative genes, 
we found two distinct patterns of synaptic gene dysregulation, with 
C420R tumours showing suppression of one subgroup of synaptic 
genes and H1047R tumours revealing robust upregulation of an entirely 
different set of synaptic genes (Fig. 2d). Together, these molecular 
analyses reveal that gliomas driven by different PIK3CA variants are 
endowed with distinct molecular features that can be uncoupled from 
PI3K activation (that is, C420R and H1047R). Moreover, these find- 
ings highlight how driver variants endowed with single amino acid 
differences, exhibiting seemingly similar phenotypic effects, can have 
distinct underlying molecular properties. 


Selective induction of hyperexcitability 

Given the distinct molecular features of tumours driven by C420R and 
H1047R mutations, we next determined whether these tumours exhibit 
unique biological properties. Because tumours driven by C420R display 
a pronounced increase in genes associated with proliferation (Fig. 2c, 
d) and cause a shorter median survival time in mice when compared to 
H1047R-driven tumours (44 versus 59 days), we evaluated tumour growth 
and proliferation during early tumorigenesis. Using BrdU labelling and 
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Fig.3|C420R and H1047R tumours promote hyperexcitability and synaptic 
imbalance across tumour models. a, Schematic of experimental plan for IUE 
model. b, EEG traces of select variant tumour mice at P45 and quantification of 
EEG analyses. Traces for C420R and H1047R occurred during seizures. Seizure 
incidence (top) and quantification of spike activity (bottom) are shown; n=4 
mice per variant. Data in seizure incidence graph denote mean values. Box 
plots denote the median value, interquartile range (25th-7Sth percentiles), 
with whiskers extending to 1.5 times the interquartile range. c, Antibody 
staining of excitatory and inhibitory synapses in P30 mouse brains at 
peritumoral margins (top) and quantification from a34,000-um’ field of view 
(bottom). GFP signal (blue, not labelled) denotes tumour. n=3 mice;n=5 


MRI imaging (Extended Data Fig. 1c—e), we found that C420R-driven 
tumours exhibit an enhanced proliferative phenotype compared to 
tumours driven by H1047R and R88Q, thus corroborating the underly- 
ing molecular features of these variants. The other defining molecular 
feature of C420R- and H1047R-driven tumours is dysregulation of distinct 
cohorts of synapse-associated genes (Fig. 2c, d). It is well-established 
that synaptic imbalance can result in extensive changes in network excit- 
ability, which in some cases culminates in seizure activity””°. Crucially, 
seizures are an early pathophysiological hallmark of malignant glioma 
that are recapitulated in our mouse model of glioma”. Therefore, we 
determined whether mice bearing tumours driven by C420R and H1047R 
exhibit pronounced network hyperexcitability during the early stages of 
tumour progression (Fig. 3a). We performed serial video electroencepha- 
lography (EEG) recordings every 5 days during the P30-P60 interval 
(Extended Data Fig. 4), and found that mice bearing tumours driven by 
H1047R show an earlier onset of seizures (50% of the cohort exhibited 
convulsive seizures at P30) than mice bearing tumours driven by C420R 
(seizures first appear at P45) (Fig. 3b). EEG recordings of these mice 
during this interval revealed considerable increases in network hyper- 
excitability, with both groups of mice demonstrating similar increases 
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technical replicates. Box plots asin b. d, Schematic of PDX model experiments. 
e, EEG traces (left) and quantification (right) of mice bearing PDX tumours 
expressing PIK3CA variants five weeks after transplantation. Trace for PDX- 
C420R demonstrates aberrant EEG patterns without associated seizures; trace 
for PDX-H1047R was during an electrographic, non-convulsive seizure. n=4 
mice per variant. Box plots are as inb. f, Antibody staining of excitatory and 
inhibitory synapses from peritumoral margins of PDX brains (top) and 
quantification froma34,000-um’ field of view (bottom).n=3 mice;n=5 
technical repeats. Box plots asin b. Scale bars, 12.5 j1m (c) and 10 pm (f). 
*P<0.05,**P<0.01,***P< 0.001, one-way analysis of variance (ANOVA). 


in spiking frequency as early as P30-P35 (Fig. 3b). By contrast, mice 
bearing R88Q-driven tumours did not demonstrate convulsive seizures 
or changes in EEG activity during the P30-P60 interval (Fig. 3b). These 
variant-specific increases in hyperexcitability correlate with the relative 
changes in synaptic gene cohorts of C420R and H1047R tumours, which 
suggests that tumours driven by these variants result in early synaptic 
imbalance during the formative stages of tumour progression. 

To determine whether C420R and H1047R tumours exhibit changes 
insynaptic phenotypes inthe tumour microenvironment during early 
tumorigenesis, we isolated C420R- and H1047R-driven tumours from 
P30 mice and stained for makers of excitatory (VGLUT1 and PSD95) and 
inhibitory (VGAT and gephyrin) synapses at the peritumoral margins 
(Extended Data Fig. 5). These studies revealed significant increases 
in excitatory synapses at the peritumoral margins of both C420R and 
H1047R tumours compared with R88Q tumours (Fig. 3c). Notably, 
analysis of inhibitory synapses revealed a decrease in their numbers 
surrounding C420R and H1047R tumours, compared to R88Q (Fig. 3c). 
These changes in synaptic constituency—increases in excitatory and 
decreases in inhibitory synapses—in neurons adjacent to C420R- and 
H1047R-driven tumours further indicate that the observed increases 
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Fig. 4|C420R and H1047R differentially promote synaptic imbalance. a, EEG 
traces and quantification of variants overexpressed in mouse brains at P40. 
Trace for H1047R was during seizure. Seizure incidence and quantification of 
total spike activity are shown; n=4 mice per variant. Box plots areas in Fig. 3b. 
Nosignificant (P< 0.05) Pvalues were calculated by one-way ANOVA. 

b, Schematic of astrocyte-neuron co-culture. WT, wild type. c, Top, antibody 
staining for synapsin 1 (red) and MAP2 (green) of astrocyte-neuron co- 
cultures; astrocytes infected with control (ctrl) or C420R or H1047R viruses. 


in network hyperexcitability result, in part, from synaptic imbalance. 
Next, we extended these functional studies to patient-derived xenograft 
models (PDX), by generating glioma stem-cell lines” that individu- 
ally overexpress H1047R, C420R and R88Q (Fig. 3d). These lines were 
subsequently transplanted into the brains of SCID mice, in which we 
assessed the onset of hyperexcitability via EEG, and found that mice 
bearing PDX-H1047R exhibited early onset of seizures whereas mice 
bearing PDX-C420R exhibited early onset of network hyperexcitabil- 
ity (Fig. 3e). PDX-R88Q did not demonstrate early onset of either sei- 
zures or hyperexcitability (Fig. 3e). Moreover, cellular analysis of the 
peritumoral synaptic constituency revealed that both PDX-H1047R 
and PDX-C420R tumours provoke an increase in excitatory synapses, 
coupled witha decrease in inhibitory synapses compared to PDX-R88Q 
tumours (Fig. 3f). These alterations in synaptic constituency are met 
with congruent changes in cell proliferation, as PDX-H1047R and PDX- 
C420R exhibit increased incorporation of BrdU (Extended Data Fig. 1f). 
Together, these functional studies in PDX models reinforce our con- 
clusion that these variants selectively promote changes in network 
hyperexcitability and synaptic constituency. 


Variant-specific synaptogenic mechanisms 


The synaptic changes at the peritumoral margin in both the 
native and PDX models suggest that these variants function via 


Bottom, quantification of PSD95 and synapsin 1 staining. n=12 technical 
replicates per condition. Data are mean ands.e.m. Scale bar, 50 pm. 

d,e, Representative traces (left) and quantification (right) of excitatory (d) 
or inhibitory (e) postsynaptic currents from whole-cell recording of neurons 
co-cultured on astrocytes infected with control, or C420R or H1047R viruses. 
n=12technical replicates per condition. Data are mean and s.e.m.*P< 0.05, 
**P<0.01,***P< 0.001, one-tailed independent t-test; Tukey’s test was used to 
compare individual mean values. 


cell-non-autonomous mechanisms. However, several PIK3CA variants 
are associated with monogenic paediatric overgrowth syndromes and 
can promote hyperexcitability on their own, including H1047R, which 
suggests that they may also function cell autonomously” ~”. To address 
this, we performed individual IVE, outside the tumorigenic context, 
with H1047R, C420R and R88Q, in conjunction with serial EEG record- 
ings every 5 days during the P30-P60 interval (Extended Data Fig. 4). 
Mice expressing H1047R exhibited increased hyperexcitability as early 
as P30, coupled with seizures at P40, whereas mice expressing C420R 
and R88Q did not demonstrate any changes during this period (Fig. 4a); 
we did not observe any changes in cell proliferation (Extended Data 
Fig. 1g-i). These data indicate that H1047R promotes hyperexcitability 
onits own, whereas C420R cannot—which suggests that these variants 
use different mechanisms to alter synaptic constituency. Next, we used 
an established neuron-glial co-culture system”*, which enables us to 
directly assess the cell-non-autonomous contributions of these vari- 
ants to synaptogenesis (Fig. 4b). We overexpressed C420R and H1047R 
in mouse astrocytes co-cultured with neurons, and found that C420R 
promotes synapse formation between neurons as measured by the 
expression of synaptic markers (Fig. 4c). Whole-cell recordings revealed 
increased frequency of excitatory and inhibitory postsynaptic cur- 
rents (EPSCs and IPSCs, respectively) inneurons when co-cultured with 
astrocytes expressing C420R (Fig. 4d, e). By contrast, overexpression 
of H1047R in astrocytes did not enhance synapse formation on neurons 
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Fig. 5| GPC3 promotes gliomagenesis and synaptic imbalance. a, Top, GPC3 
staining of human glioblastoma and healthy human brain control. Staining was 
not independently repeated. Bottom, GPC3 antibody staining of C420R and 
H1047R tumours from P30 mouse brains. n=4 mice for each variant. These 
experiments were independently repeated four times. Scale bars, 50 pm (top) 
and 100 pm (bottom). b, Kaplan-Meier curve comparing GPC3 deletion 
(GPC3“) in 2xCr; C420R tumours. n=20 (C420R) andn=30 (C420R; GPC3“) 
mice. Statistics are in Extended Data Fig. 8b. c, Quantification of total spike 
activity shows that EEG spike activity decreases with GPC3 deletion. n=4 mice 


(Fig. 4d, e). Together, these results illustrate mechanistic differences 
between H1047R and C420R. H1047R promotes brain hyperexcitability 
on its own, through cell-autonomous mechanisms, whereas C420R 
promotes brain hyperexcitability specifically in the context of a glial 
tumour, through cell-non-autonomous mechanisms. 


GPC3 promotes tumorigenesis 

To understand how C420R exerts a cell-non-autonomous effect on 
synaptogenesis, we further analysed the RNA-seq data from the variant- 
driven tumours, focusing ona set of genes that is secreted from gliaand 
promotes synapse formation onto neurons”®. Among this group, we 
found that members of the glypican family are selectively upregulated 
in tumours driven by C420R (Extended Data Fig. 6a). Analysis of The 
Cancer Genome Atlas (TCGA) data for glioma revealed that glypican 3 
(GPC3) is more highly expressed in human glioblastoma tumours than 
in lower-grade glioma (Extended Data Fig. 6b). We confirmed protein 
expression using immunohistochemistry on primary human glioma 
samples and detected increased expression of GPC3 in C420R-driven 
mouse tumours (Fig. 5a, Extended Data Fig. 6c). Together, these data 
suggest that GPC3 may contribute to glioma tumorigenesis and down- 
stream synaptic remodelling. Notably, how secreted synaptogenic 
proteins expressed in glioma influence (1) tumorigenesis, (2) the sur- 
rounding neuronal microenvironment and (3) key neurological features 
of glioma pathophysiology remain largely undefined. Therefore, exam- 
ining the function of GPC3 in this context provides an opportunity to 
understand how these facets of glioma biology are regulated. 

To determine whether GPC3 contributes to C420R-driven tumours, 
we used CRISPR-Cas9 approaches to delete GPC3 (GPC3“) in C420R- 
driven mouse tumours, and found that loss of GPC3 extended median 
survival (Fig. 5b, P44 versus P60, Extended Data Fig. 7). Notably, serial 
EEG recordings of mice bearing 2xCr; C420R; GPC“ tumours revealed a 
loss of early onset hyperexcitability compared to C420R-driven controls 
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per variant. Box plots areas in Fig. 3b. *P< 0.05, **P< 0.01, one-way ANOVA. n=4 
mice each. d, Kaplan-Meier curve comparing Gpc3 overexpression in 3xCr 
tumours. n=55 (3xCr) andn=41 (3xCr; GPC3) mice. Statistics are in Extended 
Data Fig. 8b. e, EEG traces of Gpc3 overexpression in 3xCr tumours at P50. 

f, Quantification of total spike activity (left) and seizure incidence (right). n=4 
mice (3xCr) and n=5 mice (3xCr; GPC3) for spike quantifications; n=4 mice per 
condition for seizure incidence. Box plots areas in Fig. 3b. *P< 0.05, one-way 
ANOVA. 


(Fig. 5c). Having established that GPC3 is necessary for key pathophysi- 
ology properties of C420R-driven tumours, we next evaluated whether 
itis sufficient to drive glioma tumorigenesis independently of PIK3CA 
variants. We combined PiggyBac-mediated Gpc3 overexpression with 
our 3xCr system; survival analysis revealed that mice overexpressing 
Gpc3 exhibited significantly shorter median survival than 3xCr controls 
(P59 versus P91) (Fig. 5d, Extended Data Fig. 7). Next, we examined sei- 
zure phenotypes via video EEG in 3xCr tumours overexpressing Gpc3, 
and found that mice bearing Gpc3-overexpressing tumours exhibited 
early onset of convulsive seizures and heightened network hyperex- 
citability as early as P40 (Fig. 5d-f). Together, these studies indicate 
GPC3 functions as a driver of glioma tumorigenesis that accelerates 
the onset of seizures and hyperexcitability. 

The robust effects of Gpc3 overexpression on these key tumour 
pathophysiological properties led us to examine the cellular pheno- 
types in these tumours. First, we evaluated cellular proliferation, and 
found an increase in BrdU labelling in GPC3-driven tumours (Extended 
Data Fig. 8a). Next, we probed the peritumoral synaptic constituency, 
and identified increases in both excitatory and inhibitory synapses 
in these tumours (Extended Data Fig. 8b), which suggests that GPC3 
promotes the aberrant formation of both excitatory and inhibitory 
synapses. Previous studies have demonstrated that other members 
of the glypican family promote synapse formation onto neurons 
through secretion from adjacent glial cells””*°, suggesting that GPC3 
may act by a similar mechanism. To test this, we overexpressed Gpc3 
in mouse astrocytes co-cultured with neurons and found that Gpc3 
overexpression in astrocytes markedly increases synapse formation 
onto neurons, as measured by expression of synaptic markers and 
whole-cell recordings from these neurons (Extended Data Fig. 9a-d). 
Conditioned medium from these astrocytes promoted heightened 
spontaneous postsynaptic current frequency on neurons (Extended 
Data Fig. 9e-g), which indicates that GPC3 acts via non-cell-autonomous 
mechanisms. These studies identify GPC3 as a key factor in 


glioma-induced neosynaptogenesis that manipulates the neuronal 
microenvironment during tumour progression. 


Discussion 


Decoding variant function in cancer has become an increasingly crucial 
requirement for developing personalized therapeutic agents. In this 
study, we describe an in vivo complementation screening platform that 
enables the rapid identification of driver mutations in a native model 
of glioma. Using this system, we identified several driver variants of 
PIK3CA that are active in glioma. Crucially, a subset of these variants 
(C420R, M1043] and R88Q) appear to function in a glioma-relevant 
manner, as studies in other cancer models did not identify these vari- 
ants, or their relative oncogenic potential varies”””. Together, these 
studies illustrate that variant function is context-specific and highlight 
the importance of decoding cell lineage and genetic relationships’ ™. 
Furthermore, among the PIK3CA variants that operate as drivers in our 
glioma system, transcriptome analysis identified marked molecular 
differences in the tumours driven by these variants. These findings 
demonstrate that similar variants can engender a diverse range of 
molecular properties in glioma tumours, reinforcing the importance 
of deciphering the selective functions of each driver variant. 

Our identification of proliferative and synaptic gene dysregulation 
among tumours driven by H1047R and C420R served as an entry point 
for demonstrating phenotypic differences among these tumours. 
Using both native mouse and PDX models, we found that tumours 
driven by these variants selectively alter the neuronal microenviron- 
ment towards heightened network hyperexcitability and seizure 
onset—two key features of glioma pathophysiology”. Furthermore, 
H1047R and C420R use different mechanisms to engender these com- 
mon physiological phenotypes. Nevertheless, these findings further 
illustrate that crosstalk between glioma tumours and the surrounding 
neuronal microenvironment is a key contributor to malignant growth. 
Several recent studies have shown that increases in neuronal activity 
promotes glioma tumour proliferation’”, and when put together with 
our findings, indicate a vicious cycle between glioma cells and adjacent 
neurons, in which glioma cells promote network hyperexcitability 
and these increases in neuronal activity promote tumour growth. To 
understand how these variant-driven tumours promote neuronal activ- 
ity, we identified GPC3 as a driver of glioma tumorigenesis, network 
hyperexcitability and synapse formation. Because GPC3 promotes 
synapse formation through cell-non-autonomous mechanisms, a 
model emerges in which secreted proteins from glioma stimulates 
neuronal activity by triggering synaptogenesis during early tumori- 
genesis, working together with neurons to orchestrate both network 
hyperexcitability and growth. 
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Methods 


All mouse gliomas were generated in the CD-1IGS mouse background as 
previously described’. IUE was performed onembryonic day 15. Previ- 
ously generated CRISPR constructs were used to knockout Nf1, Pten and 
Trp53 (eachat 1.5 pg per pl). Pik3ca alleles were genomically integrated 
and overexpressed through the piggyBac transposase (PBase) system. 
The pGlast-PBase plasmid (at 2.0 pg per p1l)** was co-electroporated 
witha PBCAG-PIK3CA* construct (1.0 pg per pl) (see below). In addition, 
the PBCAG-GFPt2aLuc (1.0 pg per pl) was co-electroporated to allow 
for fluorescent and bioluminescent visualization. 

In vivo functionalization studies of Gpc3 were performed by co- 
electroporating a PBCAG-GPC3 construct (at 1.0 pg per pl), generated 
through the HiTMMob approach (see below), into the 3xCr model®. 
Loss-of-function studies were performed by co-electroporating a 
CRISPR construct targeting the third exon of the mouse Gpc3 gene. 
The gRNA sequence (5’-CTTGGGT TCTGATATCAACG-3’) was gener- 
ated through Broad Institute GPP portal (https://portals.broadinsti- 
tute.org/gpp/public/analysis-tools/sgrna-design)**. Candidate gRNA 
sequence was validated through the mismatch-cleavage SURVEYOR 
assay (IDT, 706020) on genomic DNA (gDNA) obtained from tumour 
tissue (Extended Data Fig. 7c). Primer sequences for SURVEYOR are 
as follows: ON (ATACCAGCAATGAATGTGAGTCAA and CCACCCACC 
ACCACAATGAAG), OT1(TTGCACAGCATAGCCTGAGA and TACCCTCTA 
CTGCTGACCCC), OT2 (GGCATCTTGAGTGTCCTACCA and TTCTGAA 
GGAGAACTATGCTACC), OT4 (TGGAAGTAAGGAACAGGCCC and 
TGGCTCCCACTCTAACTCCTT) and OTS (AAGCATCCGTCGCTACCATC 
and AAAGCTTGGCTGGAACCTGT). All procedures were approved by 
the Institutional Animal Care and Use Committee (IACUC) at Baylor 
College of Medicine and conform to the US Public Health Service Policy 
on Human Care and Use of Laboratory Animals. 


In vivo barcode enrichment competition assay 

A piggyBac transposable vector (PBCAG-EGFP-T2A-GWRIR4) was engi- 
neered from the PBCAG-EGFP construct**. The eGFP STOP codon was 
removed and an inframe T2A sequence followed by attR1/attR4 Gateway 
cloning sites were inserted. These attR sites flanked chloramphenicol 
and cccB selection cassettes. A V5 tag sequence was also inserted down- 
stream of the attR4 site. Pik3ca alleles and barcode sequences were 
cloned in using the HiTMMOB approach’. The list of tested PIK3CA 
variants and associated mutational information are given in Supple- 
mentary Table 1. PIK3CA variant constructs were generously provided 
in collaboration with G.B.M., K.L.S. and the CTD? (https://ocg.cancer. 
gov/programs/ctd2). 

Pooled injection cocktails were assembled such that the pool of 
tested variants totalled 1.0 pg per pl rather than a single PIK3CA variant. 
In brief, equal moles of each plasmid were mixed together, ethanol pre- 
cipitated and re-dissolved/concentrated in ddH,O. This pooled plasmid 
mixture was diluted to 1.0 pg per pl in the final IUE injection cocktail. 

After IVE of the pooled cocktail (with 2xCr, PBase and GFPt2aLuc), 
mice were born and observed for symptoms suggestive of brains 
tumours (see below). After demonstration of symptoms, the mouse 
was euthanized and tumours dissected with the aid of the fluorescence 
reporter. Subsequently, gDNA was prepped with the EZNA Tissue DNA 
Kit (Omega Bio-tek, D3396), according to the manufacturer’s instruc- 
tions. Samples were prepared in biological and technical replicates 
(n=3-5 biological replicates; n = 3 technical replicates). In addition, 
the IUE injection cocktail was used for input, prepared in technical 
duplicates. 

After gDNA isolation, barcoded libraries were prepared as pre- 
viously reported!”"®, PCR reactions amplified the barcoded pools 
from 50 ng gDNA (experimental samples) or 2 ng of plasmid pool 
(input control) using Platinum Super Mix (ThermoFisher, 12532016) 
with primers targeting the T3 promoter site (directly upstream 
of the BC) (5’-CAATTAACCCTCACTAAAGG-3’) and the V5 tag 


(5’-ACCGAGGAGAGGGTTAGGGAT-3’). Amplification parameters were 
as follows: 1x (94 °C 4 min); 35x (94 °C 1min, 54 °C 1 min, 68 °C 1 min); 1x 
(68 °C 10 min); 10 °C hold. PCR products were purified with the PureLink 
PCR Purification Kit (ThermoFisher, K310001), processed using the 
lon Plus Library Kit (ThermoFisher, 4471252), subsequently purified 
and ligated to unique Ion Xpress Barcode Adaptors (ThermoFisher, 
4474517). The resulting lon Xpress barcoded libraries were amplified, 
purified, and pooled for PGM sequencing (318 V2 Chip) following the 
manufacturer’s recommendations. Raw data were concatenated into 
one‘reference’ file and indexed using Burrows—Wheeler alignment tool 
for alignment of barcode sequences (with parameters ‘-I7 -t12 -N -n3’) 
for counting the occurrence of each barcode. Barcode enrichment was 
assessed by quantifying the number of occurrences for each barcode 
sequence as a ratio to total number of barcode reads in each sample. 
Standard error of the mean was calculated across replicates and plotted 
as error bars on the barcode enrichment graphs. 


Bioluminescence imaging 

D-Luciferin (150 pg; PerkinElmer, 122799) per gram body weight was 
delivered through intraperitoneal injection. Ten minutes after injec- 
tion, bioluminescence signal was acquired for 5 min, followed by a 
10-s X-ray exposure for skeletal imaging. Bioluminescence scans were 
initiated at P60 and performed every 30 days thereafter. 


Brain tumour collection for histology 

Unless atime point was specified, tumour-bearing mice were observed 
for symptoms suggestive of tumours including—but not limited to— 
lethargy, hunched posture, decreased appetite, decreased grooming, 
trembling/shaking, squinting eyes, partial limb paralysis and abnormal 
gait, denoting the IACUC permitted endpoint. After demonstration of 
symptoms, mice were humanely euthanized and fixed through intra- 
cardial perfusion of 4% paraformaldehyde in PBS. After perfusion and 
dissection, the brain was further fixed overnight in fresh 4% paraform- 
aldehyde solution overnight and then preserved in 70% ethanol for 
eventual paraffin embedding. For frozen preservation, after perfusion 
and dissection, brains were sunk in 20% sucrose in PBS overnight and 
frozen into Tissue-Tek OCT Compound (Sakura Finetek, 4583) and 
stored at -80 °C until cryostat sectioning. 


Histological analysis 

For haematoxylin and eosin (H&E) staining, 10-jm paraffin-embedded 
sections were process as follows: 3 x 3 minin xylene, 3 x 3 minin100% 
ethanol, 3 x 3 min in 95% ethanol, 3 min in 80% ethanol, 5 min in 70% 
ethanol, 5 minin ddH,0, 2.5 minin Harris haematoxylin (Poly Scientific 
R&D, S212A), running tap water wash, 30 s in 95% ethanol, 2.5 min in 
eosin (Poly Scientific R&D, S176), 2 x 2 min in 95% ethanol, 2 x 2 min 
in 100% ethanol, and 2 x 2 minin xylene. Staining was preserved with 
Permount Mounting Media (Electron Microscope Sciences, 17986-01) 
under a coverslip. Histological diagnoses of mouse-IUE-generated 
tumours were validated across n > 6 tumours per variant. 

For immunohistology, frozen brains were sectioned to 20-40 pm 
thickness. Sections were subject to antigen retrieval (when needed), 
blocking and primary antibody incubation overnight at 4 °C. The fol- 
lowing primary antibodies were used: rat anti-BrdU (BUI1/75 (ICRI), 
1:200; abcam, ab6326), mouse anti-gephyrin (1:500; Synaptic Sys- 
tems, 147011), rabbit anti-GFP (1:1,000; ThermoFisher, A-11122), goat 
anti-GPC3 (W-18, 1:100; Santa Cruz Biotechnology, SC-10455), rabbit 
anti-human HLA A (EP1395Y, 1:100, ABCAM, ab52922), mouse anti- 
PSD95 (7E3-1B8, 1:500; ThermoFisher, MA1-046), guinea-pig anti-VGAT 
(1:500; Synaptic Systems, 131004), guinea-pig anti-VGLUTI (1:2,000; 
Millipore, AB5905). We used species-specific secondary antibodies 
tagged with Alexa Fluor 488, 568, or 647 (1:1,000, ThermoFisher) for 
immunofluorescence. After Hoechst nuclear counter staining (Ther- 
moFisher, H3570, 1:50,000), coverslips were mounted with VECTASH- 
IELD antifade mounting medium (Vector Laboratories, H-1000). DAB 


horseradish peroxidase (HRP) (Vector Laboratories, SK-4100) was used 
for chemical colorimetric detection following species-specific, HRP- 
conjugated secondary antibody labelling. 

To assay in vivo cell proliferation, 4 h before collecting, 100 pg 
BrdU (in PBS) per gram body mass was delivered through intraperito- 
neal injection. Mouse brains were collected, frozen and sectioned as 
described above. Before blocking, sections were incubated in 2 N HCI 
at 37 °C for 30 minand neutralized with 3.8% sodium borate for 10 min 
at room temperature. 

Excitatory and inhibitory synapses were quantified with the Synapse 
Counter plugin for Image]*°. The images were taken with the Leica SP8 
confocal microscope at 0.31-ym intervals over a5-ym depth (15 optical 
sections) across replicates (n = 5 technical replicates; n = 3 biological 
replicates). 

The human tumour tissue arrays (US Biomax, T175a) contained a 
spectrum of brain tumours that included high-grade gliomas, menin- 
giomas and T cell lympohoma. Non-disease brain sections were also 
present in the array. Sections were rehydrated similar to paraffin- 
embedded sections above, stained and dehydrated-preserved similar 
to our H&E staining protocol. 


RPPA 

GFP-positive tumour samples were homogenized with lysis buffer 
(1% Triton X-100, 50 mM HEPES, pH 7.4, 150 mM NaCl, 1.5 mM MgCl, 
1mMEGTA, 100 mM NaF, 10 mM Na pyrophosphate, 1mM Na,VO,,10% 
glycerol, protease inhibitor (Roche, 05056489001), and phosphatase 
inhibitors (Roche, 04906837001) onice. Cellular debris was removed by 
centrifugation (4 °C, 14,000 r.p.m., 10 min). The lysates were mixed with 
4x SDS buffer and diluted to a final protein concentration of 1-1.5 pg 
pl*. Samples were prepared in biological duplicates (n = 2). Samples 
were probed with 287 antibodies through the Functional Proteomics 
RPPA Core Facility at MD Anderson Cancer Center. The full list of tested 
antibodies, source and working dilution are available at: https://www. 
mdanderson.org/research/research-resources/core-facilities/func- 
tional-proteomics-rppa-core/antibody-information-and-protocols. 
htm? 


RNA-seq analysis 

Similar to descriptions above, tissue samples were collected with the 
aid of fluorescence microscopy from end-stage tumours. Total RNA 
was isolated using the RNeasy Plus Mini Kit (Qiagen, 74134) according 
to the manufacturer’s protocol. Samples were prepared in biological 
replicates, n >3 per variant genotype. RNA integrity (RNA integrity 
number = 8.0) was confirmed using the High Sensitivity RNA Analysis 
kit (AATI, DNF-472-0500) on a12-Capillary Fragment Analyzer. Illumina 
sequencing libraries with 6-bp single indices were constructed from1 
Lig total RNA using the TruSeq Stranded mRNALT kit (Illumina, RS-122- 
2101). The resulting library was validated using the Standard Sensitivity 
NGS Fragment Analysis Kit (AATI, DNF-473-0500) on a12-Capillary Frag- 
ment Analyzer. Equal concentrations (2 nM) of libraries were pooled 
and subjected to sequencing of approximately 20 million reads per 
sample using the Mid Output v2 kit (Illumina, FC-404-2001) ona Illu- 
mina NextSeq550 following the manufacturer’s instructions. 


Bioinformatics 

Sequencing alignment and transcript abundance estimation (frag- 
ments per kilobase of transcript per million mapped reads, or FPKM) 
was performed using HISAT and Cufflinks. Combat software” was used 
to alleviate any potential batch effects due to substantial time between 
sequencing runs. In the downstream RNA-seq analysis, we compared 
each experimental (variant) group with the wild-type group and with 
the Cherry control group (two separate comparisons) by contrasts 
using linear models on log-transformed (base 2) expression values, 
in which all samples are used to estimate the variance of each gene 
when using the full model (linear model) in design, providing greater 


statistical power. For each experimental group, genes significant versus 
both control and wild-type groups were defined, using P< 0.01 and fold 
change in the same direction for both comparisons. Expression heat 
maps were generated using JavaTreeView*’. 


MRI 

Mice were imaged at the Small Animal MR Imaging Core at Baylor Col- 
lege of Medicine using a 9.4 T horizontal bore magnet with a60-mm 
inner diameter microgradient and a 35-mm inner diameter radiofre- 
quency volume resonator (Bruker BioSpin MRI Gmbh) once a week 
starting at 4 weeks of age. Once anaesthetized (induction at 3-4% ios- 
flurane, maintenance at 2-3% in 100% oxygen at 31 min”), mice were 
placed onan animal bed, head first, in the prone position and inserted 
into the bore of the magnet. Normal body temperature was maintained 
during the imaging sessions. The respiration rate of the mice (typical 
range, 30-40 breaths min”) was monitored throughout the entire 
experiment with an abdominal pneumatic pillow (SA Instruments). 

For in vivo brain imaging of each mouse, three low-resolution scans 
(axial, sagittal and coronal slice orientations) and one high-resolution 
scan (axial orientation) were acquired in succession ina single imaging 
session. T2-weighted images were obtained using a fast spin echo pulse 
sequence with excitation and refocusing flip angles of 90° and 180° 
respectively at a rare factor of 8. In each low-resolution brain scan (rep- 
etition time/echo time = 2,500/33 ms), 10-12 slices per brain volume 
were scanned in a field-of-view of 4.0 x 4.0 cm’, with a matrix size of 
256 x 256 pixels yielding a spatial resolution of 0.156 x 0.156 mm?’ witha 
slice thickness of 1.0 mmand an interslice distance of 1.5 mm, taking 2 
averages per slice with a scan time per brain volume of 2 min40s. Each 
high-resolution brain scan (repetition time/echo time = 2,500/36 ms) 
was taken with 14-18 slices per brain volume, scanned ina field-of-view 
of 3.5 x 3.5 cm’, with a matrix size of 384 x 384 pixels yielding a spatial 
resolution of 0.091 x 0.091 mm’ with a slice thickness of 0.75 mm and 
aninterslice distance of 1.0 mm, taking 10 averages per slice witha scan 
time per brain volume of 8 min. 

Relative tumour area was calculated as the product of the height and 
width of the tumour along the dorsal-ventral and medial-lateral axis, 
respectively. The plane (along the rostral-caudal axis) that contained 
the largest tumour section at the last scan was used for analysis. 


In vivo video EEG recording 

Similar to previous descriptions», IUE mice at 3 weeks of age were anaes- 
thetized by isoflurane vaporization pump (induction at 3-4%, main- 
tenance at 2-3% in 100% oxygen at 11 min”) and surgically implanted 
with a bilateral silver wire electrode (0.005-inch diameter) attached 
toa microminiature connector. Electrodes were placed in the left and 
right, frontal and parietal regions. EEG and behavioural activity in freely 
moving mice were analysed using simultaneous video-EEG monitoring 
(Haramonie software version 6.1c, Stellate systems). All EEG signals 
were filtered using a 60-Hz notch filter, 0.3-Hz high-pass filter and 
70-Hz low-pass filter. Mouse EEG monitoring was initiated at 30 days of 
age, and continued at 5 or 10 day intervals with a 24-h recording period 
per session. Seizure activity was quantified by visual inspection of the 
EEG waveform and corresponding video-recorded behaviour. The fre- 
quency of interictal spike activity was calculated from EEG waveform 
data of each 24-h recording period using Matlab scripts. At least n=4 
mice were analysed for each condition. 


Astrocyte-neuron co-culture and immunocytochemistry 

Cortical astrocyte cultures were prepared from P1-P3 newborn wild- 
type mice. The cortex was dissected, removing the meninges, and 
coarsely chopped with surgical scissors, followed by enzymatic dissoci- 
ation of papain supplemented with DNase I (Worthington Biochemical, 
LKO03150) at 37 °C for 15 min. Enzymatic dissociation was neutral- 
ized with 10% fetal bovine serum (FBS; ThermoFisher, 16000044) in 
DMEM/F12 (ThermoFisher, 11320033). After two centrifugation and 
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PBS washes, cells were suspended in astrocyte culture medium (DMEM/ 
F12, 10% FBS, 1% penicillin/streptomcyin; ThermoFisher, 15140122), 
filtered through a cell strainer, and plated onto a poly-D-lysine-coated 
T75 flask. Once confluent, flasks were mechanically agitated to enrich 
for astrocytes. 

Astrocytes were virally infected selected through antibiotics. After 
selection, approximately 8 x 10* astrocytes were seeded on poly-D- 
lysine-coated 12 mm coverslips in a 24-well culture dish. Around 24 h 
later, approximately 6 x 10* cortical neurons were seeded on the top 
of the astrocytes and maintained in neuronal culture medium (Neu- 
robasal medium (ThermoFisher, 21103049) supplemented with B27 
(ThermoFisher, 17504044), gentamicin (ThermoFisher, 15750060) 
and GlutaMAX (ThermoFisher, 35050061)). 

For immunocytochemistry, similar to previous descriptions”, co- 
cultures were washed and fixed with 4% paraformaldehyde for 15 min, 
then rinsed with PBS, blocked and stained with primary antibodies 
overnight at 4 °C. The following antibodies were used: chicken anti- 
MAP2 (1:1,000; EnCor Biotech, CPCA-MAP2), rabbit anti-synapsin-I 
(1:1,000; Millipore, AB1543) and mouse anti-PSD95 (K28/43, 1:1,000; 
UC Davis/NIH NeuroMab Facility, 75-028). Similar to the above immu- 
nohistochemistry, species-specific Alexa Fluor 488 or 568 were used 
for secondary antibodies, followed by nuclear counter-staining and 
coverslip mounting. 


Electrophysiology 

Whole-cell recordings were performed on neurons co-cultured with 
astrocytes (after viral infection and selection) in parallel at days 
in vitro (DIV) 7 and 14, similar to a previous description. Neurons were 
recorded ina chamber with continuous flow of external solution at a 
fixed flow rate controlled by a flow valve. For each recording, data were 
filtered with a 60-Hz notch filter and analysed with template-based 
event detection algorithms in Clampfit 10.6 (Molecular Devices). 

Standard artificial cerebral spinal fluid (140 mM NaCl, 2.4 mM 
KCI, 10 mM HEPES, 10 mM glucose, 4 mM MgCl,, 2 mM CaCl; pH 
7.3; osmolarity approximately 300 mOsm) was used as the exter- 
nal solution. The internal solution contained 136 mM KCI, 17.8 mM 
HEPES, 1 mM EGTA, 0.6 mM MgCl, 4 mM ATP, 0.3 mM GTP, 12 mM 
creatine phosphate, and 50 U per ml phosphocreatine kinase. The 
recording pipettes used in the experiment had a resistance of 3-6 MQ. 
Antagonists (3 mM kynurenic acid and 20 uM bicuculline) were 
applied sequentially with 3 min of washing in between. All chemicals 
were purchased from Sigma-Aldrich. 

To determine the glutamatergic or GABAergic identities of the spon- 
taneous activities recorded, the spontaneous synaptic potential was 
first recorded for 2 min for baseline activities, as well as activities under 
the treatment of 3 mM kynurenic acid and 20 pM bicuculline (2 min for 
each treatment with 3 min of no treatment in between application of 
the two antagonists). An event witha rise time of approximately 0.5 ms 
and decay time of less than 20 ms were considered to bea glutamatergic 
event, whereas an event with a rise time of around 0.5 ms and a decay 
time of more than 20 ms were considered to be a GABAergic response. 

For the GPC3 condition medium study, as previously described”’, 
COS-7 cells (ATCC, CRL-1651) cultured in neuronal culture medium 
(see above) were transfected with PBCAG-GPC3 (PBCAG-GFP as con- 
trol) using Lipofectamine 2000 Transfection Reagent (ThermoFisher, 
11668019) according to manufacturer’s instructions. Conditioned 
medium was collected and concentrated with Spin-X UF concentra- 
tor columns (Corning, CLS431482). After quantification of protein 
concentration using a standard Bradford assay, protein was diluted 
into neuronal culture medium to 80 pg per ml, and this medium was 
added onto cortical neuron cultures at DIV1. Whole-cell recordings and 
immunocytochemistry were performed at DIV 9. Whole-cell record- 
ings were done with any antagonists to measure total spontaneous 
postsynaptic potential. Neurons co-cultured with wild-type primary 
astrocytes were used as a positive control. 


PDX model 

Patient-derived primary glioblastoma cell lines were maintained in 
neurosphere medium (DMEM/F12, ThermoFisher, 11320082), sup- 
plemented with B27 (1X, ThermoFisher, 17504001), bFGF (20 ng mI’, 
Peprotech, 100-18B) and EGF (20 ng mI”, Peprotech, 100-47)). Cells 
were lentivirally infected with different PIK3CA mutants (or amCherry 
control reporter). After 48 h of infection, cells were selected for by 
puromycin (1pg mI“, ThermoFisher, A1113803). Approximately 50,000 
live cells were stereotaxically transplanted into the brains of 6-week-old 
male ICR-SCID (Taconic, ICRSC-M) mouse brains (from bregma, cells 
were injected 1.0 mm lateral, 2.0 mm caudal, 2.5 mm ventral). Similar 
to previously described approaches”, a week after transplant, EEG 
recording electrodes were implanted and recorded from once a week 
for the subsequent four weeks. 


Statistical analyses 

The log-rank test was used to compare survival differences across 
groups for Kaplan-Meier survival analysis, and values are listed in 
Extended Data Table 1. One-way ANOVA was used to compare BrdU 
proliferation (across technical replicates), MRI tumour growth, EEG 
spike quantification and synapse quantification differences between 
group means, followed by Tukey’s test to compare individual means. 
Independent t-test was used to compare differences across groups for 
EPSC and IPSC frequency and amplitude quantifications. Significant dif- 
ferences are denoted by asterisks in associated graphs, and an absence 
of asterisks denotes no significant difference unless otherwise stated. 
Distribution of data was assumed to be normal, but this was not formally 
tested. Randomization of animal studies was used in the data analysis. 
No data points were excluded from analyses. Mice were excluded from 
analyses if they did not demonstrate reporter (GFP) activity after IUE, 
signifying unsuccessful electroporation. No statistical methods were 
used to predetermine sample size, and investigators were not blinded 
to allocation during experiments and outcome assessment. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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Extended Data Fig. 1| See next page for caption. 
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Extended Data Fig. 1| Differential tumour growth across PIK3CA variant 
tumours. a, Whole-brain GFP fluorescence image overlaid ona brightfield 
image representing characteristic tumours at time of death. b, Representative 
bioluminescence intensity images taken at 1-month intervals starting at 

2 months of age, reflective of median survival trend. Scale bar for 
bioluminescence intensity quantifies photon counts over the 5 min of IVIS 
recording. Experiments were independently repeated three times with similar 
results for each variant. c, Longitudinal T2 MRI of variant tumours. Readings 
were taken at 1-week intervals from 4 weeks of age. All images are spatially 
matched along the rostral-caudal axis. Red dotted outline at P42 denotes the 
tumour boundary. Yellow scale bar, 2.5 mm.n=4 mice for each variant. These 
experiments were not independently repeated. d, BrdU (red) antibody staining 
on 30-day-old mouse brains. White scale bar, 40 um. n=4 mice for each variant. 
n=18(R88Q), 19 (C420R) and 16 (H1047R) technical repeats. e, Associated 
quantification for relative tumour area from MRI and BrdU incorporation 


analysis. Box plots are as in Fig. 3b. *P< 0.05, ***P< 0.001, one-way ANOVA. 

f, Left, immunofluorescence analysis of BrdU (red) incorporation on PDX 
tumour sections. Human tissue was identified by staining for human HLA 
(green). Blue scale bar, 100 pm. Right, quantification of BrdU incorporationina 
2,000 um? area. n=4 mice; n=5 technical repeats. Box plots areas in Fig. 3b. 
***P< 0.001, one-way ANOVA. g, Schematic illustrating experimental approach 
and timeline. h, Representative image of control GFPIUE. Proliferative index 
was calculated by dividing the number of BrdU’ cells on the electroporated side 
(marked by GFP) by the number of BrdU‘ cells on the non-electroporated, 
contralateral side. Purple scale bar, 100 pm. i, Quantification of proliferative 
index for activating variants C420R and H1047R along with control (GFP), 
demonstrating no significant difference in proliferation. n=3 mice;n=5 
technical repeats. Box plots are as in Fig. 3b. Pvalues were not significant 
(P>0.05, one-way ANOVA). 
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Extended Data Fig. 2 | In vivo competition assay identified PIK3CA driver tumours co-electroporated with the H1047Rallele (tagged with barcode 
variants. a, Results of next-generation barcode sequencing from 2xCr tumour sequence 27), demonstrating single amplification when diluted with50 
tissue co-electroporated with all tested alleles pooled together. Pool includes different passenger barcodes (uniquely barcoded Cherry constructs).n=2 
listed variants along with wild-type (WT), 4 silent mutants (E542E, E545E, tumours with 2-3 replicates; red bars denote tumour samples; grey bars denote 
T1025T and H1047H) and 18 uniquely barcoded Cherry constructs as controls. input. Data are mean and s.e.m. 


n=3 tumours. Data are mean ands.e.m. b, Barcode sequencing for 2xCr 
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Extended Data Fig. 3 | PIK3CA variants did not alter tumour histopathology. anaplastic astrocytomaor grade IV glioblastoma. Black scale bars, 1mm; yellow 
H&E staining of brains containing hypercellular and infiltrative high-grade scale bars, 100 ppm; green scale bars, 50 pm. Representative images of each 
gliomas with pleomorphic tumour cells. All tumours are histologically graded variant-driven tumour are fromn=6 brains. 

as high-grade glioma, either WHO (World Health Organization) grade III 
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Extended Data Fig. 4 | PIK3CA variant differentially promote both tumour- seizures confirmed with simultaneous videos. Traces plot from top to bottom 
associated and unassociated seizures. Longitudinal EEG recordings from are recordings from the left frontal, left parietal, right frontal and right parietal 
PIK3CA variant tumour brain (2xCr) and non-tumour brains, starting at P30. brain regions. Traces are representative of four mice per variant. Vertical scale 


Red boxes outline interictal spike activity. Red asterisks denote generalized bars, 300 pV; horizontal scale bars, 0.5s. 
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Extended Data Fig. 5 | PIK3CA variants differentially alter the local synaptic 200 pm of the tumour margin (dotted line), as marked by GFP (pseudo- 
constituency at the peritumoral margins. a,b, |Immunofluorescenceanalysis | colouredinblue). Higher magnification images from the red and blue boxes are 
of tumour brains, stained for excitatory (a) and inhibitory (b) synapses by the displayed. White scale bars, 200 pm; blue scale bars, 50 pm; yellow scale 
colocalization of pre-and postsynaptic markers. Analyses were focused within bars, 12.5 um. Experiments were independently repeated 15 times. 
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Extended Data Fig. 6 | Expression of GPC family members across mouse and 
human glioblastoma models. a, RNA-seq analysis for astrocyte-secreted 
factors that promote synaptogenesis. Each column represents an average of 
biological replicates. n=2 (wild-type), n=4 (C420R), and n=3 (H1047R) mice. 
Pvalues were determined by two-sided ¢-test on log-transformed expression 
values. b, RNA-seq data from the TCGA comparing the same set of genes across 
glioblastoma (GBM) and low-grade glioma (LGG). Asterisks denote significant 


difference between glioblastoma and low-grade glioma, determined by two- 
sided t-test on log-transformed expression values. c, Immunofluorescence 
analyses of P30 variant tumour brains stained for GPC3 (red) and GPF (green), 
demonstrating GPC3 staining at the tumour core (left) and tumour margin 
(right). n=4 mice for each variant. White scale bars, 100 tm. Experiments were 
independently repeated four times. 
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Extended Data Fig. 7| Context-specific requirement for GPC3 in glioma from 2xCr; C420R; GPC3“ tumour tissue. Representative DNA gel 
tumorigenesis. a, H&E staining of 2xCr; C420R; GPC3“ (top) and 3xCr; GPC3 electrophoresis of PCR products after SURVEYOR enzyme treatment. After the 
(bottom) tumour brains histologically graded as high-grade glioma (either ladder (1-kb Plus, ThermoFisher, 10787018), which ranges from 100 to1,500 bp, 
WHO grade lll anaplastic astrocytoma or grade IV glioblastoma). Black scale lanes from left to right contain on-target (ON) and top five off-target (OT1-OTS) 
bars, 1mm; yellow scale bars, 100 pm; green scale bars, 50 um. Representative sites with and without SURVEYOR nuclease (SN) treatment. OT3 rests in an AT- 
images of each variant driven tumour are fromn=6 brains. b, Survival statistics — richregionand was not amplifiable. This experiment was independently 

for in vivo modelling of GPC3 loss and gain in various tumour models. Pvalues repeated three times with similar results. 


were determined by log-rank test. c, SURVEYOR assay analysis of genomic DNA 
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Extended Data Fig. 8 |GPC3 promotes gliomagenesis and synaptic 
imbalance. a, Immunofluorescence staining of 3xCr control and Gpc3 
overexpression tumours for BrdU, PSD95, VGLUT1and VGAT. White scale 

bar, 50 um; blue scale bar, 5 um. b, Quantification of BrdU immunofluorescence 
analysis (left) and excitatory (middle) and inhibitory (right) synapses. n=4 
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(BrdU) and n=3 (synapse) mice for each condition; n =18 (BrdU-3xCr), n=16 
(BrdU-GPC3) and n=15 (synapse) technical repeats for each condition and 
synapse type. Field for BrdU incorporation = 1,600 pm’; field for synapse 
analysis = 34,000 um. Box plots areas in Fig. 3b. *P< 0.05, **P< 0.01, 

***P< 0.001, one-way ANOVA. 
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Extended Data Fig. 9 | See next page for caption. 
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Extended Data Fig. 9 |GPC3 promotes synaptogenesis. 

a, Immunofluorescence staining of co-cultures of astrocytes and neurons for 
PSD95, synapsin1 and MAP2 (Fig. 3b), with astrocytes overexpressing GPC3 via 
virus. Non-infected astrocytes were used as acontrol. White scale bar, 50 pm. 
b, Quantification of PSD95 (top) and synapsin1 (bottom) staining. n=32 
technical repeats for each condition. Data are mean and s.e.m. ***P< 0.001, 
one-tailed independent t-test; Tukey’s test was used to compare individual 
mean values. c, Representative traces of from whole-cell recording of neurons 
co-cultured onastrocytes virally overexpressing GFP (control) or GPC3, with 
associated scale bar. d, Quantification of EPSC and IPSC amplitude and 
frequency from co-cultures. n = 54 (GPC3, EPSC); n=48 (control, EPSC);n=60 
(GPC3, IPSC); n=46 (control, IPSC). Data are mean and s.e.m. ***P< 0.001, 


one-tailed independent t-test; Tukey’s test was used to compare individual 
mean values. e, Immunofluorescence staining of neuron cultures for PSD95, 
synapsin1land MAP2. Wild-type astrocyte-neuronco-cultures served asa 
positive control. Cortical neuron cultures were grown in GPC3 condition 
medium (CM) or GFP control conditioned medium. White scale bar, 50 pm. 

f, Quantification of PSD95 and synapsin staining. n =12 technical replicates 
for each condition. Data are mean ands.e.m. ***P< 0.001, one-tailed 
independent t-test; Tukey’s test was used to compare individual mean values. 
g, Representative traces of spontaneous postsynaptic current (SPSC) analysis 
of neurons co-cultured with astrocytes, GPC3 conditioned medium or GFP 
control medium, with associated scale bar. 


Extended Data Table 1| PIK3CA variant modelling survival statistics 


Survival Range (days' p-values (log rank test 

Sere i Last 2xCr; 2xCr; 2xCr; 2xCr; 2xCr; 2xCr; 2xCr; 2xCr; 2xCr; 
Genotype otal 3 2 death _Median|H1047R__M1043l__H701P__V638A__E545K_—>C420R_—saR88Q PIK3CA__Cher 
3xCr 105 84 0.000129 0.137233 7.49E-11 1.73E-10 0.000733 1.61E-15 0.000605 6.65E-13 2.28E-13 
2xCr; Cherry 351 203 4.51E-13 1.54E-12 0.000378 0.606653 2.56E-13 9.73E-15 6.69E-11 0.000469 
2xCr; PIK3CA 252 150 3.62E-13 6.34E-10 4.41E-09 0.00151 3.10E-13 1.02E-15 1.61E-06 
2xCr; R88Q 155 103 2.54E-05 0.067048 1.43E-10 2.36E-08 9.78E-05 2.71E-08 
2xCr; C420R 84 44 0.008385 8.79E-08 1.30E-11 1.60E-11 0.001131 
2xCr; E545K 119-63 0.398171 0.001046 9.81E-11 2.49E-10 
2xCr; V638A >365* 203 3.90E-10 4.38E-09 0.000867 
2xCr; H701P. >365*__ 273 1.79E-10_1.17E-26 
2xCr; M1043 134 80 0.000236 
2xCr; H1047R 11559 


Statistics for survival study of PIK3CA variants in 2xCr background. 3xCr model served as a positive control. All studies were terminated at 1 year of age. P values were determined by log-rank 


test. 
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in reporting. For further information on Nature Research policies, see Authors & Referees and the Editorial Policy Checklist. 


Statistics 


For all statistical analyses, confirm that the following items are present in the figure legend, table legend, main text, or Methods section. 


n/a | Confirmed 


The exact sample size (n) for each experimental group/condition, given as a discrete number and unit of measurement 


A statement on whether measurements were taken from distinct samples or whether the same sample was measured repeatedly 


4 The statistical test(s) used AND whether they are one- or two-sided 
Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


A description of all covariates tested 


A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 


A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 


— For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
Give P values as exact values whenever suitable. 


For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 


[| For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 


Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 


Policy information about availability of computer code 


Data collection Sequencing alignment and transcript abundance estimations were performed using HISAT (version 2.1.0) and Cufflinks (version 2.2.1.2). 
Combat software (PMID: 16632515) was used to alleviate any potential batch effect due to significant time between sequencing runs. 
Data analysis Expression heat maps were generated using JavaTreeView (version 1.1.6r4) (PMID: 15180930). 


For manuscripts utilizing custom algorithms or software that are central to the research but not yet described in published literature, software must be made available to editors/reviewers. 
We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 


Data 


Policy information about availability of data 


All manuscripts must include a data availability statement. This statement should provide the following information, where applicable: 
- Accession codes, unique identifiers, or web links for publicly available datasets 
- A list of figures that have associated raw data 
- Adescription of any restrictions on data availability 


All data used in this study are available through the NCBI Gene Expression Omnibus (GEO) Repository with a series accession number GSE123519. Sample 
accession number range from GSM3506046 to GSM3506071, 26 samples total. Figure 2, extended data figures 5, supplemental figure 4, and supplemental table 3 
present associated data. 
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No calculations were used to determine sample size for any of our experiments. In general, we aimed for a minimal sample size of 3 for 
significance. However, each experiment may have had its own minimal sample size for the varying reasons (abundant substrate, pragmatic 
rationale, based on other published work). Our reasons for samples size for each experiment are as follows: for our barcode sequencing 
screen, we initially analyzed 5 animals in technical triplicates. Across biological replicates, this generated strong reproductions thus we did not 
extend our analyses into more tissue. For our survival studies, we aimed for approximately 20-25 animals as each animal would then 
represent 4-5 percentage points of resolution. If more animals were generated for a particular genotype, they were included. Histological 
analysis was done on at least 6 tumors per variant. Sequencing of tumor tissues was done on at least 3 animals with the exception of our 
nontumor control which we only did in duplicate. Given how well each variant aligned within its own cohort compared to other variants, we 
felt confident more samples were not required. Both MRI and immunohistofluorescence (IHF) analysis were performed on 4 animals per 
condition. In vitro co-culture and electrophysiology were across biological triplicates. For these aforementioned experiments (MRI, IHF, 
coculture), given the minimal N of 3 and the statistically significant trends we observed, we did not see a need for increased sample size. EEG 
analysis was one 4 animals per genotype at 5 or 10 day intervals. Other studies, even with smaller variations to test, do not demonstrate the 
rigor in temporal profiling that we executed. Given this and the statistically significant trends we observed, we did not see the need to 
increase our sample size. 


The only samples that were excluded from this study were mouse that did not have any GFP reporter activity. Our method utilizes a GFP 
report as a control for proper technique. The absence of GFP would be indicative of poor electroporation technique. Only these samples were 
excluded. 


Mouse samples were obtained across multiple litters. Within a single genotype, electroporation were done in at least 2 pregnant mothers on 
different dates. These cohorts were analyzed separately and compared. Only when these two cohorts were not significantly different, did we 
merge them together. Through the course of our study, replicate cohorts were never significantly different (p>.05). All attempts at replication 
were successful. 


While we reason randomization was not necessary, we still exercise measures for randomization. For RNA-Sequencing, samples were taken 
from animals that expired across the whole range of their survival. The 3 animals were either among the first, middle, or last third to die. We 
utilized a similar approach in the samples used for our H&E stained histological analysis. 


While we reason that blinding was not necessary, we still exercised measures for blinding. This project also required much collaboration. 
Whenever samples were passed from one investigator to another, sample were only given a de-identified label. The samples were only 
properly labeled after analysis was complete. 


Reporting for specific materials, systems and methods 


We require informatio 


n from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 


system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 

n/a | Involved in the study n/a | Involved in the study 

| Antibodies [| ChiP-seq 
Eukaryotic cell lines Flow cytometry 
Palaeontology MRI-based neuroimaging 
Animals and other organisms 


Human rese 


Clinical data 


Antibodies 


arch participants 


Antibodies used 


Antibodies are listed in the following format: host species anti-antigen (clone, dilution; source, product number; lot number). 
Clone IDs are not reported if one was not assigned. 


For IHC and ICC: 
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rat anti-BrdU (BU1/75 (ICR1), 1:200; ABCAM, #ab6326; GR3269246-1), mouse anti-Gephyrin (3B11, 1:500; Synaptic Systems, 
147011; 1-64), rabbit anti-GFP (1:1000; ThermoFisher, #A-11122; 1753594), goat anti-GPC3 (W-18, 1:100; Santa Cruz 
Biotechnology, #SC-10455; B2912), chicken anti-MAP2 (1:1,000; EnCor Biotech. Inc., #CPCA-MAP2; 7225-9), rabbit anti-synapsin- 
| (1:1,000; Millipore, #AB1543; 2363548), mouse anti-PSD95 (for ICC) (K28/43, 1:1,000; UC Davis/NIH NeuroMab Facility, 
#75-028; 455.7.JK.22), rabbit anti-human HLA A (EP1395Y, 1:100, ABCAM, ab52922; GR258732-22), mouse anti-PSD95 (for IHC) 
(7E3-1B8, 1:500; ThermoFisher, MA1-046; LI147875), guinea pig anti-vGAT (1:500; Synaptic Systems, 131004; 2-41), and guinea 
pig anti-vGlut1 (1:2000; Millipore, ABS905; 3193844) 


For RPPA, lot numbers were not available. These were used through the MD Anderson Functional Proteomics RPPA Core. Details 
stated in methods: rabbit anti-4E-BP1 phospho-S65 (1:250; Cell Signaling Technology, #9456), rabbit anti- Akt phosphor-S473 
(1:150; Cell Signaling Technology, #9271), rabbit anti-Akt phospho-T308 (1:250; Cell Signaling Technology, #2965), rabbit anti- 
GSK3a/B phosphor-S21/S9 (1:200; Cell Signaling Technology, #9331), rabbit anti-mTOR phosphor-S2448 (1:50; Cell Signaling 
Technology, #2971), rabbit anti-p70 S6 Kinase phosphor T389 (1:50; Cell Signaling Technology, #9205), rabbit anti-PRAS40 
phospho-T246 (1:500; ThermoFisher, #441100G), rabbit anti-Rictor phospho-T1135 (1:200; Cell Signaling Technology, #3806), 
rabbit anti-S6 phospho-S235/S236 (1:2500; Cell Signaling Technology, #2211), rabbit anti-S6 phospho-S240/ S244 (1:1000; Cell 
Signaling Technology, #2215); rabbit anti-Tuberin/TSC2 phospho-T1462 (1:38; Cell Signaling Technology, #3617) 
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Validation All immunohistofluorescence and immunocytofluorescence staining were accompanied with secondary antibody only controls 
(not included in manuscript). Presented images are only those where accompanying secondary only controls showed no signal. 
Validation for these antibodies are as follows: rat anti-BrdU (PMID: 16670699), rabbit anti-GFP (currently over 266 published 
manuscripts listed: https://www.thermofisher.com/antibody/product/GFP-Antibody-Polyclonal/A-11122), goat anti-GPC3 (PMID: 
21112773, 16734618, 15475451), chicken anti-MAP2 (PMID: 15642108, 2469170, 6120944, 16832065, 18988710, 22623668, 
16817858, 16487970), rabbit anti-synapsin-I (PMID: 26060345, 26184109, 24633176, 24633867, 24638034, 25403753, 
24719092, 25471559, 24184637, 24945922), mouse anti-PSD95 (currently over 54 published manuscripts listed: https:// 
www.labome.com/review/gene/human/PSD-95-antibody.html), and rabbit anti-human HLA A (currently lists 37 published 
manuscripts listed: https://www.abcam.com/hla-a-antibody-ep1395y-ab52922.html). The mouse anti-Gephyrin, mouse anti- 
PSD95, guinea pig anti-vGAT, and guinea pig anti-vGlut1 used for immunohistological analysis of synapses were all according to 
PMID: 27615741. Antibodies used for RPPA were validated through the Functional Proteomics RPPA Core adhering to the 
following 3 criterion: A single or dominant band on western blotting is required and dynamic range and specificity is determined 
using: 1) peptides, phosphopeptides, growth factors, inhibitors, RNAi, cells with wide levels of expression including 330 cell lines 
under multiple conditions on a single array; 2) Pearson correlation coefficient between RPPA and western blotting of greater 
than 0.7 is required; and 3) reproducible results intra-slide or interslide are critical. 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) COS7 - ATCC, CRL-1651; primary human GBM cell line 

Authentication COS7 cells was not authenicated; primary human GBM cell lines were received from Dr Frederick Lang, MD Anderson, 
Houston, TX 

Mycoplasma contamination Cells were not tested for mycoplasma contamination 


Commonly misidentified lines No commonly misidentified cell lines were used 
(See ICLAC register) 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals Mus musculus, outbred ICR CD-1 IGS, both male and female mice were used in this study. These mice were all mice were less 
than 1 year old. Additionally, SCID-ICR male mice were used for xenograft studies. These mice were 6 weeks old at tumor 
transplantation. Xenografts were left to grow up to 6 weeks post transplants. 


Wild animals No wild animals were used 
Field-collected samples No samples were collected from the field 
Ethics oversight All procedures were approved by the Institutional Animal Care and Use Committee at Baylor College of Medicine and conform to 


the US Public Health Service Policy on Human Care and Use of Laboratory Animals. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics We do not have this information. All human derived samples were received from Dr Frederick Lang, MD Anderson (PMID: 
29016843) 


Recruitment We do not have this information. All human derived samples were received from Dr Frederick Lang, MD Anderson (PMID: 
29016843) 


Ethics oversight Samples were collected through the auspicies of Dr Frederick Lang's IRB protocol (LABO4-001). We only recieved samples post- 
de-idenficiation. We were told these samples were published (PMID: 29016843). We were not given any information towards 
the identify of the source patient. Additionally, we have not conducted any experiments whereby we could acertain such 
information. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Magnetic resonance imaging 


Experimental design 


Design type Weekly longitudinal structural imaging 


Design specifications T2 weighted low resolution and high resolution anatomical scans were acquired one time per week for four weeks 
unless the animal perished prior to the four week time point. 
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Behavioral performance measures N/A 


Acquisition 

maging type(s) Structural Imaging 

Field strength 9.4T 

Sequence & imaging parameters For in vivo brain imaging of each mouse, three low resolution scans (axial, sagittal and coronal slice orientations) and 
one high resolution scan (axial orientation) were acquired in succession in a single imaging session. T2-weighted images 
were obtained using a fast spin echo pulse sequence with excitation and refocusing flip angles of 90 and 180° 
respectively at a rare factor of 8. In each low-resolution brain scan (TR/TE = 2500/33 ms), 10-12 slices per brain volume 
were scanned in a field of view (FOV) of 4.0 x 4.0 cm2, with a matrix size of 256 x 256 pixels yielding a spatial resolution 
of 0.156 x 0.156 mm2 with a slice thickness of 1.0 mm and an interslice distance of 1.5 mm, taking two averages per 
slice with a scan time per brain volume of 2 min 40 s. Each high-resolution brain scan (TR/TE = 2500/36 ms) was taken 
with 14-18 slices per brain volume, scanned in a field of view (FOV) of 3.5 x 3.5 cm2, with a matrix size of 384 x 384 
pixels yielding a spatial resolution of 0.091 x 0.091 mm2 with a slice thickness of 0.75 mm and interslice distance of 1.0 
mm, taking 10 averages per slice with a scan time per brain volume of 8 min. 

Area of acquisition Multiple slices were taken that covered the whole brain 

Diffusion MRI Used Not used 


Preprocessing 


Preprocessing software None 
Normalization None 
Normalization template None 
Noise and artifact removal None 
Volume censoring Visual inspection -- no significant motion artifacts were detected during imaging. 


Statistical modeling & inference 


Model type and settings We performed univariate analysis on the relative tumor area. Relative tumor area was calculated as the product of the 
size of the aberrant growth along the dorsal-ventral and medial-lateral axises. This analysis was performed at a fixed 
location along the rostral-caudal axis. Analysis was done across fixed time points: 4, 5, and 6 weeks of age. Comparisons 
were done across genotypes. The analysis was across 4 mice per cohort unless the mice died prior to the acquisition of 
a datapoint. 


Effect(s) tested No behavior effects were tested. The only differences across cohorts were genetic differences. All the animals were 
treated the same prior to, during, and after the MRI recording session. 


Specify type of analysis: [ ]Wholebrain [x] ROl-based [| Both 


MicroDicom, a free DICOM viewer and software, was used to visualize and analyze MRI images. 
Suspected tumor regions were identified in the last scan, prior to mice being euthanized. These 
anatomical regions were backtracked across time. Suspected tumor regions were based against age 

Anatomical location(s) match control animals which did not have any manipulation done to them, as well as differences in 
symmetry between the two hemispheres of the brain. Relative tumor area was calculated as the product 
of the size of the aberration along the dorsal-ventral and medial-lateral axises. The same anatomical 
sections along the rostral-caudal axis was used for analysis. 


Statistic type for inference N/A 
(See Eklund et al. 2016) 


Correction No correction methods were used 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 
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Graph analysis 


Multivariate modeling or predictive analysis 
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Many proteins exist naturally as symmetrical homooligomers or homopolymers’. The 


emergent structural and functional properties of such protein assemblies have 
inspired extensive efforts in biomolecular design’ >. As synthesized by ribosomes, 
proteins are inherently asymmetric. Thus, they must acquire multiple surface patches 
that selectively associate to generate the different symmetry elements needed to 
form higher-order architectures’*—a daunting task for protein design. Here we 
address this problem using an inorganic chemical approach, whereby multiple modes 
of protein-protein interactions and symmetry are simultaneously achieved by 
selective, ‘one-pot’ coordination of soft and hard metal ions. We show that a 
monomeric protein (protomer) appropriately modified with biologically inspired 
hydroxamate groups and zinc-binding motifs assembles through concurrent Fe* and 
Zn” coordination into discrete dodecameric and hexameric cages. Our cages closely 
resemble natural polyhedral protein architectures”* and are, to our knowledge, 


unique among designed systems 


°3 in that they possess tightly packed shells devoid 


of large apertures. At the same time, they can assemble and disassemble in response 
to diverse stimuli, owing to their heterobimetallic construction on minimal 
interprotein-bonding footprints. With stoichiometries ranging from [2 Fe:9 Zn:6 
protomers] to [8 Fe:21 Zn:12 protomers], these protein cages represent some of the 
compositionally most complex protein assemblies—or inorganic coordination 
complexes—obtained by design. 


Cage-like architectures have featured prominently in supramolecular 
design, owing to their aesthetically appealing structures and isolated 
interiors, which enable them to encapsulate molecular cargo and to 
perform selective chemical transformations“. Inspired by naturally 
occurring polyhedral assemblies, protein engineers have combined 
principles of symmetry with the proper design and arrangement of non- 
covalent interfaces to build diverse supramolecular architectures? ”. 
However, some of the key structural features of natural protein cages 
have been difficult to emulate (Fig. 1a). First, each cage is invariably com- 
posed of asymmetric protomers, which possess multiple self-associative 
patches to simultaneously satisfy the symmetry requirements necessary 
to build polyhedral assemblies (that is, concurrent generation of at least 
C,and C, symmetries, in addition to C, or C, symmetries for octahedra 
or icosahedra)’*. Second, these self-associative patches collectively 
occupy alarge fraction of the surface area on each protomer, enabling 
the formation of tightly packed shells with small apertures to enable the 
influx and efflux of select species®. Third, although the inter-protomer 
interfaces in natural protein cages are extensive, to ensure stable and 
selective association, they are also often conformationally flexible and 
chemically tunable, allowing the cages to undergo cooperative motions 
or disassembly in response to external cues”. 

Given the difficulty of designing multiple, selectively associative 
surfaces on a protomer, construction of artificial cages has relied 


exclusively on using natively oligomeric proteins or designed peptides 
with C,.. symmetries as building blocks and the design of asingle type 
of binary protein-protein interaction (PPI) through computation”, 
genetic fusion”, disulfide bond formation” or metal coordination””. 
Although these strategies can yield polyhedral symmetries, the result- 
ing architectures are highly porous, do not display externally control- 
lable assembly or disassembly (with two exceptions)? and cannot be 
easily modified to adopt alternative structures (that is, they are not 
modular or flexible). Inspired by previous work on bimetallic supramo- 
lecular coordination cages”°”, we investigated whether these design 
problems could be addressed using an inorganic chemical approach, 
wherein a protomer is equipped with chemically orthogonal coordina- 
tion motifs to self-assemble into polyhedral architectures. 


Design of bimetallic protein cages 


Previously, we have taken advantage of the simultaneous strength, labil- 
ity and directionality of metal coordination bonds (particularly those 
formed by late first-row, low-valent transition metal ions) to effect the 
self-assembly of discrete protein complexes® and extended one-, two- 
and three-dimensional arrays”. Typically, selective nucleation sites 
for metal-mediated PPIs are formed by pairs of metal-binding amino 
acids (mainly His, Asp and Glu residues) (Fig. 1b) or non-native bidentate 
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Fig. 1| Design of protein cages. a, Representative examples of natural protein 
cages (DNPEP, aspartyl aminopeptidase; HuHF, human heavy-chain ferritin) 
and their assembly from asymmetric protomers. Per-protomer solvent- 
accessible surface areas (SASA) and buried surface areas (BSA) are indicated. 
Associative surfaces on the protomers are coloured red for homologous 
interactions and orange/yellow or blue/cyan for heterologous interactions. 
b-f, Design of artificial protein cages by metal coordination. b, C,-symmetric 


functionalities (for example, 2,2’-bipyridine, 1,10-phenanthroline and 
8-hydroyxquinoline)°”*. However, all of these natural or synthetic 
coordination motifs can be considered as soft (or intermediate-soft) 
according to the Hard-Soft Acid-Base (HSAB) classification” and have 
considerable overlap in terms of their coordination preferences for 
soft, low-valent transition metal ions. Owing to this lack of chemical 
discrimination, it has not been possible to design a heterometallic 
protein complex for which the self-assembly is selectively guided by 
multiple metal ions that mediate different PPIs. 

To achieve this goal, we turned to a bidentate chelating motif, 
hydroxamate (HA, the conjugate base of hydroxamic acid),acommon 
functional group found in bacterial siderophores to enable excep- 
tionally stable coordination of Fe* ions”°””. HA groups preferentially 
form octahedral Fe** complexes with an inherent C, symmetry that 
we sought to impose on protein oligomerization (Fig. 1c). Notably, 
the formation constants of Fe**:(HA), complexes (>10”° M *) are vastly 
higher than those of other metal-HA complexes, such that they can be 
considered as orthogonal to the aforementioned soft metal-ligand 
combinations”°””. For protein derivatization, we synthesized a small 
reagent, iodo-hydroxamic acid (IHA), which selectively reacts with Cys 
residues (Fig. 1d, Extended Data Fig. 1). The resulting Cys—HA side chain 
is isosteric with that of arginine and devoid of bulky aromatic moieties, 
furnishing a pseudo-natural amino acid functionality with the ability 
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protein dimerization induced by tetrahedral Zn” coordination of native amino 
acid side chains. c, C;-symmetric protein trimerization induced by octahedral 
Fe**-tris-hydroxamate coordination. d, Scheme showing modification of native 
Cys side chains with IHA to yield Cys—HA, whichis isosteric with arginine 

(light grey). e, Zn-mediated solution dimerization and crystallization of 
CFMCL1.f, Structural overview of the cytochrome cbs, scaffold. Salient 
structural elements are shownas sticks. 


to chelate hard metal ions and induce C, symmetric oligomerization 
onasingle-residue footprint. 

As amodel system, we used cytochrome cb,,,,a monomeric four- 
helix-bundle protein that has proved to bea versatile building block for 
metal-directed protein self-assembly°. A variant of cyt Cbs¢, (CFMC1), 
which was designed and observed to form Zn-mediated dimers in solu- 
tion, crystallizes into rhombohedral lattices in which the protomers 
arrange into dodecameric, cage-like units via Zn-mediated crystal pack- 
ing interactions”® (Fig. le). Whereas Zn-mediated interactions were 
not sufficiently strong to maintain the tetrahedral dodecamers upon 
crystal dissolution, we envisioned that these lattice units could serve 
as astructural model to engineer the protomers such that they would 
form self-standing cages. Looking first to stabilize the C, symmetric 
interfaces, we incorporated a bidentate His8—Asp12 motif to medi- 
ate the antiparallel association of two protomers along their helices 1 
via tetrahedral Zn”* coordination (Fig. 1f). Given that C, symmetric 
interfaces are small and heterologous (that is, they involve two dif- 
ferent patches on each protomer; Extended Data Fig. 2a), they were 
unsuitable for stabilization by noncovalent interactions. Therefore, 
we focused on the central pores ineach C, symmetric substructure and 
identified positions 63 and 82s suitable locations for installing Cys-HA 
functionalities, which would stabilize trimeric substructures by form- 
ing Fe**:(HA), centres (Fig. 1f). Thus, we prepared two CFMC1 variants 
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Fig. 2| Characterization of BMC2 cages. a, ns-TEM of BMC2 cages obtained by 
the dissolution of three-dimensional crystals. Inset, close-up of the boxed 
region. Scale bar, 50 nm. b, AUC characterization of BMC2 protomers and 
BMC2 cages after crystal dissolution and after subsequent treatment with 
EDTA. c, Crystal structure of the BMC2 cage. Fe and Znions are represented as 
orange/red and blue spheres, respectively. The central cavity is highlighted by 
a yellow sphere. Two types of C; vertices formed by Fe:(Cys63-HA), and 
Fe:(Cys82-HA), coordination motifs form two superimposed tetrahedra to 
generate atriakis tetrahedron. d, Surface representations of the BMC2 cage, 
with metal ions shownas coloured spheres. Insets show atomic details of each 
metal coordination site, with the mF, — DF, electron density omit map (blue 
mesh) contoured at 30. 


designated bimetallic cage 1 and bimetallic cage 2 (BMC1 and BMC2; 
Extended Data Fig. 2b). Both BMC1and BMC2 bear the His8-Asp12 motif 
on helix land Cys63-HA along with the native peripheral Zn coordina- 
tion sites (Alalj.,¢.m, ASP39 and His77) of the parent CFMCI1 structure. 
BMC2 additionally contains Cys82—HA (Extended Data Figs. 1, 2b). 

Crystals of BMC1 and BMC2 were obtained in the presence of near 
equimolar ZnCl, and FeSO,. These crystals were isomorphic (R32 space 
group; a= b=126 +1A,c=167 +1A) with those of CFMC1’s, indicating 
that they possessed the same underlying lattice structure composed 
of dodecameric units (Extended Data Table 1, Extended Data Fig. 2). 
Crystals were dissolved in a solution lacking the precipitating agent 
(PEG-400) and then analysed by negative-stain transmission electron 
microscopy (ns-TEM; Fig. 2a, Extended Data Fig. 3). The images revealed 
uniform particles with a diameter of 8.4 + 0.8 nm in the case of BMC2 
but not BMC1, implying that two HA coordination motifs are neces- 
sary for cage stability. Analysis of the same BMC2 solution by analyti- 
cal ultracentrifugation (AUC) indicated a predominant species witha 
molecular weight (MW,,,) of about 140 kDa (Fig. 2b), approximating 
the calculated value (MW.,,),) of 150 kDa for a dodecamer. BMC2 par- 
ticles dissociated upon treatment with ethylenediamine tetraacetic 
acid (EDTA), confirming their metal-dependent self-assembly (Fig. 2b, 
Extended Data Fig. 3). 

We determined the crystal structure of the BMC2 cage at 1.4 Areso- 
lution (Extended Data Table 1), revealing a compact structure with 
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the shape of a truncated tetrahedron, outer dimensions of 80 x 90A 
and a cavity volume of 32,700 A? (Fig. 2c, Extended Data Fig. 4). Like 
natural protein cages, the shell is tightly packed and the largest open- 
ing measures less than 4 A across. Nearly 30% of the surface area 
of each protomer (1,700 A? out of 6,500 A?) is buried in interfaces 
despite a design footprint of only four amino acids (His8, Asp12, Cys63 
and Cys82). 

The full complement of metal ions, comprising eight Fe ions (four 
eachin the C, symmetric pores) and eighteen Zn ions (six in C, interfaces 
and twelve in peripheral sites) are clearly resolved (Fig. 2c). Anomalous 
X-ray diffraction data collected at and below Fe and Zn K-edges indicate 
that the designed Fe- and Zn-coordination sites exclusively bind to their 
cognate ions with no evidence of crosstalk (Extended Data Fig. 5, Sup- 
plementary Tables 3-6), which establishes that the metal-dependent 
self-assembly of BMC2 cages occurs with absolute chemical selectiv- 
ity. The Fe centres form the eight C, vertices of atriakis tetrahedron, a 
Catalan solid with twelve equivalent faces (Fig. 2c). It can be viewed as 
the superposition of two tetrahedra: four Fe centres that are coordi- 
nated by Cys63-HA motifs generate the larger of these two tetrahedra 
(with an edge length (l.4..) of 62 A), and four Fe centres coordinated by 
Cys82-HA motifs produce the smaller one ((.4. = 44 A) (Fig. 2c). The 
BMClstructure, in comparison, has a regular tetrahedral arrangement 
of four Fe centres as it lacks the Cys82-HA group (Extended Data Table 1, 
Extended Data Fig. 2). 

As designed, the edges of the BMC2 tetrahedra are formed by six Zn 
ions located centrally in C, interfaces (Fig. 2d). The Fe**:(Cys63-HA), 
and Fe**:(Cys82-HA), motifs display near-ideal octahedral geome- 
tries (Fig. 2d), with the former in A (left handed) and the latter in A 
(right handed) configuration (Extended Data Fig. 5). Notably, the Fe™: 
(Cys82-HA), centre also adopts an alternative conformation (20% abun- 
dance) owing to the flexibility of the Cys—HA side chain (Extended Data 
Fig. 5k). All Fe-O bond distances are in the range of 1.95-2.1A, whichare 
typical of Fe**:(HA), complexes”. Given that a Fe* precursor was used 
to initiate self-assembly, and as Fe:HA, centres have low reduction 
potentials (£,.4<-400 mV)”, this observation suggests that the protein 
self-assembly involves the initial formation of Fe’*:HA centres, fol- 
lowed by the thermodynamically favoured oxidation of these species 
into Fe* either by the Fe**-haem centres embedded in each protomer 
(see Methods) or directly by ambient O,. 


Reversible assembly of dodecameric cages 


Despite the stability of isolated BMC2 cages, their formation required 
an initial crystallization step. We reasoned that slow crystal nucleation 
or growth kinetics and the high attendant protein and metal concen- 
trations probably increased the fidelity and yield of the complex self- 
assembly process to produce a discrete supermolecule consisting of 
12 protomers and 26 metal ions of two different kinds. Reasoning that 
strengthened Zn-mediated interactions across the C, interface could 
increase the efficiency of cage self-assembly in solution, we gener- 
ated two second-generation variants based on BMC2: BMC3 and BMC4 
(Extended Data Figs. 1, 2b). In BMC3, the helix 1 surface was engineered 
to form two Zn-coordination sites (composed of HisS, His8, Asp12 and 
His16) across the C, interface, whereas in BMC4 three potential Zn- 
coordination sites were engineered (one central site composed of two 
His8-Asp12 pairs as in BMC3 and two peripheral sites composed of 
Glu2, GluS, His16 and Glu25). In BMC4, we also removed the Cys63—-HA 
group withthe purpose of eliminating any potential undesired assem- 
bly products that involve heteromeric Fe* coordination by Cys63-HA 
and Cys82-HA. 

BMC3 indeed formed dodecameric cages in solution with high yields 
(>80%) as determined by ns-TEM and AUC measurements (Fig. 3a, 
Extended Data Fig. 3). The 1.85 A-resolution crystal structure confirmed 
the eight Fe centres in the vertices and the two Zn coordination sitesin 
each C, interface (twelve in total) as well as nine of the twelve possible 
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Fig. 3 | Characterization of BMC3 cages. a, AUC characterization of BMC3 self- 
assembly. b, Surface representation of the BMC3 cage (as derived from the 
crystal structure), oriented to show the incorporation of two Znions at the C, 
symmetric interface. c, The 2.6 A-resolution cryo-EM electron density map for 
the BMC3 cage. d, Atomic details of both Zn-binding sites of the two-fold 
interface overlaid with the electron density mF, — DF, omit map from the crystal 
structure (left) and as observed for the cryo-EM structure (right). Additional 
side chains and waters are shown for the cryo-EM structure to emphasize the 
structural robustness of the interface. e, Overlay of the BMC3 X-ray and cryo- 
EM structures to highlight the isotropic expansion of the cage in the absence of 
crystallographic packing interactions. 


peripheral Zn sites, which complete a [8 Fe:21 Zn:12 protomers] archi- 
tecture (Fig. 3b). Notably, the self-assembly of BMC3 cages in solution 
was dependent on the presence of both Fe and Zn ions. The absence 
of either metal ion or the addition of various other first-row transition 
metal ions instead of Fe led to smaller oligomeric forms of BMC3 or 
non-specific assemblies (Fig. 3a, Extended Data Fig. 6a). BMC3 cages 
also formed witha Fe* precursor, Fe(acetylacetonate), (Extended Data 
Fig. 6b). Consistent with self-assembly under thermodynamic control, 
the formation of dodecameric cages was independent of the order 
of addition of Fe or Zn ions. We determined the single-particle cryo- 
electron microscopy (cryo-EM) structure of isolated BMC3 cages at a 
resolution of 2.6 A (Fig. 3c, Extended Data Table 2). A major portion of 
the assembly could be resolved at 2.0 A or less (Extended Data Fig. 7). 
At this resolution, nearly all side chains, Zn coordination sites and some 
ordered water molecules are clearly distinguished (Fig. 3d). Consistent 
with the crystallographically observed flexibility of Fe**:(Cys-HA), coor- 
dination sites, electron densities inthe C, vertices are diffuse and some 
side chains that display high temperature factors in the crystal struc- 
ture are found in alternative conformations in the cryo-EM structure 
(Extended Data Fig. 7). These observations confirm that the solution 
architecture of the BMC3 cage closely reflects the solid-state structure. 
Probably owing to lattice packing, the latter is isotropically compressed 
by around 2-3 A compared to the former (Fig. 3e), which can be accom- 
modated by slight changes in interfacial metal coordination. 

Next, we examined the assembly and disassembly behaviour of BMC3 
cages in response to different stimuli. BMC3 cages readily disassemble 
upon treatment with EDTA (Extended Data Fig. 6c). They were sta- 
ble at 50 °C but dissociated upon incubation at 70 °C (Extended Data 
Fig. 6d). A key feature of siderophores is that their cellular release of Fe 
is promoted by the destabilization and labilization of their Fe**:(HA), 
centres through reduction to the Fe” form in the cytosol’. Along these 
lines, the treatment of BMC3 cages with a strong reductant (dithion- 
ite; F,.4< -500 mV)” led to their disappearance and the emergence 
of monomeric species (Extended Data Fig. 6e). By contrast, a weaker 
reductant (ascorbate; F,.4>-100 mV at pH 7)” witha reduction potential 
higher than that of Fe**:(HA), had considerably less effect (Extended 
Data Fig. 6e), suggesting that the disassembly of BMC3 cages occurs 


©  Fe?*(Cys82-HA), Zn?*:(H5/D8), Zn?*:(D21/E25/H77) 


Expected MW: 75 kDa 


Normalized c(M) 


. 80 100 120 
Molecular weight (kDa) 


[Fe,:Zng:BMC4,] 


fs 


BMC3 


BMC4 metal skeleton BMC4 


Fig. 4 | Characterization of BMC4 cages. a, AUC characterization of BMC4 self- 
assembly. b, Crystal structure of the BMC4 cage. Fe and Znions are represented 
as red and blue spheres, respectively. The central cavity is highlighted bya 
yellow sphere. The structural skeleton formed by Fe and Znions is shown below 
the structure. c, Surface representations of the BMC4 cage, with metalions 
shownas coloured spheres. Atomic details of each metal coordination site are 
shown in insets, with the mF, — DF, electron density omit maps (blue mesh) 
contoured at 30. d, Comparison of the C, symmetric protein interfaces in 
different BMC constructs. Residues 8 and 12, which are common toall 
constructs, are coloured purple. The slippage of the two-fold helix interface to 
accommodate the hexameric architecture of BMC4 is indicated with red 
arrows. e, Comparison of the apical angle formed by the Fe:(Cys82-HA);- 
mediated vertices in BMC3 and BMC4 cages. 


through the reduction of the Fe centres. These observations establish 
BMC3 cages as a distinctive system among natural and artificial protein 
architectures the assembly and disassembly of which can be controlled 
through multiple stimuli: chemical, thermal or redox. BMC3 cages can 
passively encapsulate small fluorogenic molecules in either their lumen 
or inter-protomer interfaces, retain them for several days and release 
them upon treatment with EDTA (Extended Data Fig. 8). 


Formation ofa hexameric cage 

Unexpectedly, AUC measurements indicated that the other second- 
generation variant, BMC4, self-assembled as a hexamer upon Fe and 
Zn coordination, with yields exceeding 70% (Fig. 4a). The 1.50 A resolu- 
tion crystal structure of the BMC4 assembly revealed aD, symmetric, 
cage-like architecture with a composition of [2 Fe:9 Zn:6 protomers], 
outer dimensions of 75 A x 50 A and a cavity volume of more than 
7,800 A? (Extended Data Fig. 4). The overall shape is a trigonal bipy- 
ramid (Fig. 4b), which is the smallest polyhedral architecture witha 
sizeable interior cavity that can be constructed from an asymmetric 
building block. The apical vertex of each pyramidal halfis formed bya 
Fe**:(Cys82-HA), motif shared by three protomers (Fig. 4b). These C, 
symmetric vertices are further reinforced by Zn” ions that link pairs 
of protomers through Asp21, Glu25 and His77 coordination (Fig. 4c). 
The pyramids are joined by three equatorial, C, symmetric vertices 
mediated by Zn centres coordinated to GluS and His8 (Fig. 4b, c). Acom- 
parison to the BMC2 and BMC3 cages indicates that this unexpected 
coordination motif requires an approximately 9 Aslip of each protomer 
along the C, symmetric interfaces (Fig. 4d). The shift markedly reduces 
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the C, symmetric contact area between protomers, effectively trans- 
forming the edges in the tetrahedral BMC2 and BMC3 cages to vertices 
inthe trigonal bipyramidal BMC4 cages. BMC4 cages exhibited similar 
thermal stability to BMC3 cages, with both species disassembling at 
below 70 °C. The thermal robustness of the BMC3 and BMC4 cages 
appear to be limited, at least in part, by the relative instability of the 
individual protomers (Extended Data Fig. 6d). 

The large structural transformation is accompanied by a reduction 
in the apical angle formed at the Fe**:(Cys82-HA),-mediated vertices 
from 101° in the BMC2 and BMC3 cages to 81° in the BMC4 cage (Fig. 4e). 
This observation highlights the conformational adaptability of the 
Fe**:(Cys82-HA), coordination motif, enabling it to accommodate 
different polyhedral geometries. Such behaviour is reminiscent of 
the interfacial flexibility in some icosahedral virus capsids in which 
the same protomer can form hexamers on capsid faces and pentam- 
ers on capsid vertices’. It is worth noting that BMC4 contains all of the 
Zn-coordinating residues on helix 1 to form the C, symmetric inter- 
faces observed in the dodecameric BMC3 cage, indicating that the 
self-assembly process selects an alternative interfacial arrangement 
of lower free energy, enabled by the reversibility of metal coordination 
interactions. Interms of protein design, a caveat of interfacial flexibility 
is that it may lead to nonspecific or unintended self-assembly products, 
although it can also allow error correction during self-assembly and 
increase tolerance to design imperfections. 


Conclusions 


The self-assembly and function of biomolecular systems are predicated 
upon their specificity, stability and adaptiveness, which, in turn, are 
enabled by extensive networks of non-covalent interactions. Here, 
we have shown that fundamental concepts in inorganic coordination 
chemistry can be applied to achieve all of these attributes in protein 
self-assembly and, specifically, to construct complex polyhedral pro- 
tein architectures from a simple, asymmetric building block. Despite 
their minimal design footprints, these cage-like architectures are 
distinguished by their structural compactness and responsiveness— 
hallmarks of evolved systems suchas viral capsids. Key to our construc- 
tion strategy was the reimagination of a biological coordination motif, 
hydroxamic acid, within a new structural context: as anew amino acid 
side chain with the ability to chelate hard metal ions. This example 
expands the growing lexicon of post-translational modifications that 
broaden the chemical scope of proteins. 
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In follow-up experiments to this Letter, we have been unable to replicate 
key aspects of the original results. Most importantly, the findings from 
behaviour studies and sequencing of microglia isolated from 564lgi 
autoimmune mice as shown in Figs. 1a, b, d and 3a, b are not substanti- 
ated upon further analysis of the original data. The authors therefore 
wish to retract the Letter. We deeply regret this error and apologize to 


our scientific colleagues. 
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Methods 


Synthesis of the IHA ligand 

O-tritylhydroxylamine was synthesized as previously described. 
Chloroacetyl chloride (0.58 ml, 7.3 mmol) was dissolved in 2 ml CH,Cl, 
and added dropwise to a suspension of O-tritylhydroxylamine (2.0 g, 
7.3 mmol) and N,N-diisopropylethylamine (2.5 ml, 14.5 mmol) in 
15mlCH,Cl, at 0 °C. The reaction mixture was gradually warmedto room 
temperature and stirred at room temperature for an hour. An additional 
15mICH,Cl,wasaddedandthereaction was extracted withH,0(3x30ml). 
The CH,CI, solution was collected and evaporated to dryness. A solu- 
tion containing 15 ml of CHCl, with 10% (v/v) trifluoroacetic acid was 
added and the solution was stirred for 30 min. The crude product was 
purified by silica gel chromatography using a gradient of O-100% ethyl 
acetate in hexanes as the eluent. The product was visualized using 
a FeCl, stain. Yield, 55%. Measured molecular weight (m/z): 108.37 
[M — H*]; calculated: 107.99 [M — H*].'H NMR: (300 MHz, DMSO-d,) 
610.88 (s, 1H), 6 9.15 (s, 1H), 83.93 (s, 2H). °C NMR: (500 MHz, DMSO-d,) 
6 162.88, 6 40.45. 2-chloro-N-hydroxyacetamide (400 mg, 3.7 mmol) 
and Nal (2.7 g, 18.3 mmol) were refluxed in 30 ml acetone for 1h. The 
reaction mixture was purified by silica gel chromatography with 100% 
ethyl acetate as the eluent and dried in vacuo. Yield, >90%. Measured 
molecular weight (m/z): 223.85 [M + Na*]; calculated: 223.95 [M + Na*]. 
'H NMR: (300 MHz, DMSO-d,) 6 10.81 (s, 1H), 6 9.09 (s, 1H), 6 3.51 
(s, 2H). °C NMR: (500 MHz, DMSO-d,) 6 164.83, 6 -2.01. 


Protein expression and purification 

All constructs (Supplementary Table 1) were derived from the par- 
ent pET-20b(+) plasmid containing the CFMC1 gene via site-directed 
mutagenesis as previously described”***. The appropriate plasmids 
were transformed into BL21(DE3) Escherichia coli cells (New England 
Biolabs) housing aCCM (cytochrome C maturation) cassette contain- 
ing achloramphenicol-resistance marker and expressed as previously 
described® with minor adjustments. Multiple 2.8-1 flasks containing 
1.51 of LB medium were shaken at 200 rpm for 12 hat 37 °C and then at 
100 rpm for an additional period of around 7 h. Cells were collected 
by centrifugation (5,000 rpm for 10 min at 4 °C), resuspended ina 
buffered solution containing 5 mM sodium acetate (NaOAc) (pH 5.0) 
and 2 mM dithiothreitol (DTT) and lysed via sonication. The pH of the 
crude lysate was first raised to 10 using NaOH to precipitate cellular 
contaminants, then reduced to pH 4.5. After centrifugation (12,000 rpm 
for 20 min at 4 °C), the clarified supernatant was decanted and 
diluted 15-fold with additional buffer. This solution was applied to a 
CM sepharose gravity column (GE Healthcare) pre-equilibrated with 
the aforementioned buffer and subjected to multiple buffer washes 
before elution using a stepwise-gradient of NaCl (0-0.5M). Peak elution 
fractions were combined and concentrated using a 400-ml Amicon 
Stirred Cell (Millipore) and buffer-exchanged by overnight dialysis 
against a buffered solution containing 10 mM phosphate (pH 8.0) at 
4 °C. Next, the protein was purified viaa DuoFlow workstation station 
fitted with a Macroprep High Q-cartridge column (BioRad) and eluted 
using a linear gradient over O-0.5 M NaCl. Fractions that exhibited an 
RZ ratio (A4))/A80) > 4.4 were pooled, treated with 2 mM EDTA for Lh, 
concentrated, and buffer-exchanged into 20 mM tris(hydroxymethyl) 
aminomethane (Tris) (pH 7.5) pretreated with Chelex 100 resin (Bio- 
Rad), via desalting column (Econo-Pac 10DG pre-packed columns, 
BioRad). Demetallated and purified proteins were concentrated to 
around 2 mM and stored at 4 °C. 


Protein labelling and post-labelling purification 

Purified protein solutions were treated with a 100-fold excess of DTT 
and placed in an anaerobic Coy chamber for approximately 2 h for 
slow degassing to remove dissolved oxygen. The fully reduced protein 
solution was buffer-exchanged into 20 mM 4-(2-hydroxyethyl)-1-pip- 
erazineethanesulfonic acid (HEPES) (pH 7.5) via desalting column to 


remove DTT, and the concentration of the resulting protein solution was 
determined spectroscopically (Agilent 8452 spectrophotometer) using 
the £451 req) = 162,000 M ‘cm? (ref. **). Solid iodohydroxamic acid (IHA) 
was dissolved in 100 pl degassed DMF to generate solutions containing a 
15-fold excess IHA per protein monomer, which were then added to pro- 
tein aliquots and incubated overnight. The HA-functionalized variants 
were removed from the Coy chamber and separated from unreacted 
or partially reacted protein via FPLC using a Q-column equilibrated 
with 10 mM N-cyclohexyl-2-aminoethanesulfonic acid (CHES) (pH 9.3) 
and 2 mM DTT and eluted using a linear gradient over O-0.5 M NaCl. 
Protein functionalization was verified using electrospray ionization 
mass spectrometry (ESI-MS; Extended Data Fig. 1) and the resulting 
protein solutions were buffer-exchanged into demetallated 20 mM 
Tris (pH 7.5) via desalting column, concentrated to around 2 mM and 
stored at 4 °C for further use. 


Redesign of CFMC1 interfaces 

To render the CFMC1 protomer competent for the bimetallic design 
strategy, we first performed the following mutations to remove poten- 
tial competitive interactions: C67E, H59S and H73N. A negative design 
strategy was then used to disrupt a noncovalent dimerization inter- 
face found in CFMC1, leading to the mutations A34Q and A38Q. We 
further identified the dearth of protein-protein interactions within 
the core and periphery of the three-fold axis engulfing the 82 posi- 
tion asa likely contributor to poor cage assembly and crystallization 
in general. Accordingly, as a means to facilitate cage formation, we 
adopted Rosetta-prescribed mutations at the following positions: 
A24T, Q25(T/E), N8OK and E81Q. 


Crystallography 

Screening and crystallization of all BMC variants were conducted via 
sitting drop vapour diffusion. In brief, solutions containing 2.1-2.2 mM 
BMC protomer were mixed with mother liquor (1 p11 +1 pl) and equili- 
brated against 200-l reservoir volumes. Supplementary Table 2 details 
the experimental conditions for crystal growth. Protein solutions of 
BMCland BMC4 were first incubated with FeSO, for 1h before mixing 
with ZnCl,. Solutions of BMC2 and BMC3 were mixed with FeSO, and 
ZnCl, stock solutions and were immediately combined with the mother 
liquor (to prevent rapid aggregation of the proteins functionalized with 
two HA units). Crystals for all mutants typically appeared within several 
hours and were collected within a week of maturation. Crystals were 
cryoprotected by submersion into perfluoropolyether cryo oil (Hamp- 
ton Research) for a few seconds and flash-frozen in liquid nitrogen. X-ray 
diffraction data were collected at 100 K at either the Advanced Light 
Source (ALS) beamline BL 8.3.1 (using 1.12 A radiation for BMC3 and 
1.33 Aradiation for BMC4) or at the Stanford Synchrotron Radiation 
Lightsource (SSRL) beamlines 9-2 (using 0.98 A radiation for BMC2) 
and 12-2 (using 0.98 A radiation for BMC1). Data integration was per- 
formed using the XDS Program Package, truncated at CC, >0.5 (ref. **). 
Datasets of the same structure recorded at different wavelengths were 
scaled to the highest resolution dataset with XSCALE*””’, Phaser-MR” 
was used to carry out molecular replacement with search models based 
onthe CMFC1 monomer (PDBID:3M4B) containing the expected side 
chain mutations (generated in Pymol*°) but lacking HA. Rigid-body and 
structure refinement was performed using multiple rounds of Phenix. 
refine’, interspersed with manual model rebuilding and metal/ligand 
placement with Coot*.. Restraint files for the Cys-hydroxamic acid con- 
jugates were generated using phenix.eLBOW to maintain the distances 
Cys-SG-HA-Cl (1.816 A+ 0.02 A) and angles Cys-CB-Cys-SG-HA-Clas 
well as Cys-SG-HA-C1-HA-C2 (both 109° + 3°) during refinement. Where 
necessary, the metal binding geometry of the hydroxamic acids was 
restrained to the distances Fe-HA-O1 (1.98 A + 0.05 A) and Fe-HA-O2 
(2.057 A +0.05A) as well as through a planarity constraint for the atoms 
Fe, HA-O1, HA-O2 and HA-C1 following data from a high-resolution 
structure of Fe(1I1)-tris-benzhydroxamate trihydrate”. Simulated 
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annealing omit maps (metal atoms and side chain ligands) were gen- 
erated for each metal binding site and model accuracy was assessed 
critically against these omit maps. Electron density maps were gener- 
ated using Phenix and all molecular graphics images were produced 
with either Pymol or the UCSF ChimeraX package from the Computer 
Graphics Laboratory, University of California, San Francisco”. 


Crystallographic metal content analysis 

Metalions, with their relatively high-energy inner electrons, canabsorb 
and resonate with soft X-rays; this leads, among other effects, to differ- 
ences in the intensity of otherwise centro-symmetric Bragg diffraction 
peaks used for X-ray crystallography. Density maps calculated from 
these differences are routinely used to locate and identify metalionsin 
proteincrystals. The magnitude of this anomalous X-ray diffraction var- 
ies with the X-ray energy, with stark differences around the energies of 
the K-andL-shell electrons ofthe respective elements allowing oneto dis- 
cern between elements at a position in question, if diffraction datasets 
are measured at the appropriate wavelengths. For a visual analysis of the 
bound metals, the scaled datasets of different wavelengths were used 
separately as an input for a single phenix.refine run, each with the final 
model of the highest resolution dataset. Importantly, only the B-factor 
or occupancy were allowed to change during refinement, resulting in 
anomalous difference density maps for each wavelength. Using these 
maps, isomorphous difference maps from data at wavelengths above 
and belowthe respective element K-edges were generated (if applica- 
ble) with Phenix and were inspected manually (Extended Data Fig. 5). 
To gaina more quantitative understanding of the identity of the bound 
metals for each site, the anomalous difference signal of each dataset 
was used to generate CCP4 format maps with phenix.mtz2map. The 
generated maps were used subsequently as inputs to calculate the 
mean signal ina sphere of 1A radius centred on each metal atom with 
the program MAPMAN (Uppsala Software Factory). For each pair of 
datasets above and below a metal-absorption edge, the ratio of the 
anomalous signal above and below the edge for every metal atom was 
tabulated. The experimental ratio was compared to the theoretical 
ratio for both Fe and Zn (Extended Data Fig. 5) according to http:// 
skuld.bmsc.washington.edu/scatter, as calculated using the Cromer 
and Liberman approximation. Theoretical ratios were also calculated 
for hypothetical mixed occupancy Fe/Zn metal sites and compared to 
experimentally observed values (Supplementary Tables 3-6). 


Protein cage sample preparation 

Self-assembled cages. All samples were prepared inalow-O, atmos- 
phere (Coy glovebox) to minimize undesired oxidation of Fe** ions 
before self-assembly. Protein solutions containing 20 tM BMC3 or 
100 1M BMC4 in 20 mM Tris (pH 8.5) were incubated with either 20 1M 
FeSO, and 60 uM ZnCl, for BMC3 or 50 pM FeSO, and 200 uM ZnCl, for 
BMC4 for 2-3 hto yield the metallated cages. We note that the addition 
of FeSO, was followed by a small but observable change in the colour 
of the solution from red to pink, attributed to a shift of the haem Soret 
band to longer wavelengths, which suggested reduction of the haem by 
the ferrous ions and generation of ferric ions in close proximity to HA 
group(s). The final BMC3 solutions were then concentrated sevenfold 
before overnight incubation to improve the total cage yield. After self- 
assembly, the resulting solutions were diluted back to their original 
concentrations with the self-assembly buffer before characterization. 


Dissolved crystals. Fe:Zn:BMC1 and Fe:Zn:BMC2 crystals were dis- 
solved using buffer containing 100 mM HEPES (pH 7.5), 200 mM MgCl, 
and 800 uM ZnCl,. Mature crystals were removed from their pedestal 
droplet, briefly submerged in fresh buffer to remove uncrystallized pro- 
tein and surface-bound precipitates, and transferred into a new sitting 
drop crystallization well containing 8 pl buffer solution. The crystals 
were physically crushed with a small metal scalpel and vigorously pipet- 
ted until a large portion of the crystals dissolved. Undissolved crystals 


were removed by centrifugation (10,000 rpm for 5 min at 25 °C), yielding 
alight-red supernatant and dark-red precipitate. 


Negative-stain transmission electron microscopy 

A 4-pl droplet of BMC cages (either self-assembled or from dissolved 
crystals) was deposited onto formvar/carbon-coated Cu grids (Ted 
Pella) (pretreated by negative-mode glow discharge up to 15 min before- 
hand) and allowed to bind for 5 min. The grids were then washed with 
50 pl MilliQ water, blotted using Whatman filter paper and stained 
using 2% uranyl acetate solution in water and blotted again. Grids were 
imaged using a FEI Sphera transmission electron microscope operating 
at 200 keV, equipped with an LaB, filament and a Gatan 4K CCD camera. 
Micrographs were collected using objective-lens underfocus settings 
ranging from 250 nmto 2 umand analysed using Fiji (http://fiji.sc/Fiji). 


Oligomerization state determination using AUC 

Sedimentation velocity measurements were performed at 41,000 rpm 
and 25 °C using an XL-1 analytical ultracentrifuge (Beckman Coulter) 
equipped with an AN-60 Ti rotor. Data processing was performed using 
Sedfit* with the following parameters as calculated using SEDNTERP: 
viscosity: 0.01000 poise, density: 0.9988 g/ml (self-assembled samples) 
or viscosity: 0.0113191 poise, density: 1.0196 g/ml (dissolved crystals), 
and a partial specific volume of 0.7313 ml/g for all samples. All reported 
results correspond to a confidence level of 0.95. 


Preparation of samples involving crystal dissolution 

Dissolved crystal samples (BMC1 and BMC2), prepared as described 
above at ambient conditions, were diluted to 350 pl with 10 mM HEPES 
(pH 7.5), 200 mM MgCl, and 800 uM ZnCl. The solution was clari- 
fied via brief centrifugation in order to remove crystal debris and the 
supernatant was placed inside the cells. 


Calculation of BMC void volumes 

Structures of complete cage assemblies for BMC2, BMC3 and BMC4 
were generated via the application of crystallographic symmetry 
operations to the fully refined asymmetric unit of each construct. 
These coordinates were recentred at the origin and stripped of waters, 
hydrogens, alternative conformations and crystallization reagents 
(PEG-400). Volumetric maps and volumes for the internal cavity of 
each cage were calculated using VOIDOO*, and are reported as the 
solvent-accessible volume for a 1.4 A rolling probe ona 0.25 A grid 
spacing for all constructs. The cavity volumes using these parameters 
were determined to be approximately 32,700 A? (BMC2), 32,700 A® 
(BMC3) and 7,900 A? (BMC4). 


Solution self-assembly, disassembly and thermal stability of 
BMC3 and BMC4 

Assembled samples were prepared as described above and placed 
inside the AUC measurement cells anaerobically (20 pM BMC3 and 
100 tM BMC4). Disassembly of the cages via metal-ion removal was 
performed by treating the protein cages with 2 mM EDTA for 1h. Redox- 
controlled disassembly of the protein commenced by the addition of 
either 5 mM sodium dithionite or 5 mM sodium ascorbate to the cage 
solution anaerobically and subsequent incubation of the samples at 
around 22 °C for 16 h. Samples were then loaded into the AUC meas- 
urement cell. 

For thermal stability measurements, samples were placed ina ther- 
moregulated chamber preequilibrated at the appropriate temperature 
for 2h, and subsequently removed from the chamber and equilibrated 
at room temperature for 30 min before AUC analysis. Circular dichroism 
(CD) spectra were measured using an Aviv 215 spectrometer. CD meas- 
urements were performed using 10 uM protein ina buffered solution 
containing 20 mM Tris (pH 8.5). Thermal melts were measured at 222nm 
atalnmslit width, scanning at 1-nm intervals with a1-s integration time. 
Measurements were taken from 25 °C to 85 °C at 2-degree intervals 


with a2 min equilibration at each temperature. Unfolding data were 
fit to a two-state model with van’t Hoff’s enthalpy using the CalFitter 
web server**. 


Cryo-EM sample preparation 

Self-assembled BMC3 cages were removed from the anaerobic Coy 
chamber immediately before grid preparation. A 3.5-p1l aliquot of self- 
assembled BMC3 cages was dropped onto holey carbon grids (Electron 
Microscopy Sciences, Quantifoil R1.2/1.3 holey carbon on 300 mesh 
copper) that had been freshly glow-discharged for 30 s. The initial 
application of the sample was side blotted manually with Whatman 
No. 1 filter paper immediately followed by a secondary application 
of a3.5-ul aliquot, blotted for 3.5 s and plunge-frozen in liquid ethane 
cooled by liquid nitrogen using a Vitrobot Mark IV (FEI). 


Cryo-EM data acquisition and image processing 

Samples were imaged on a Titan Krios G3 transmission electron 
microscope (FEI) operating at 300 kV equipped with a K2 Summit 
direct electron detector (Gatan) and a GIF Quantum energy filter. 
The slit-width of the energy filter was set to 10 eV. Movies were 
collected at a magnification of 165,000 in EFTEM mode giving 
a physical pixel size of 0.84 A/pixel. In total, 4,672 movie stacks 
(50 frames/movie) were collected using a10 s exposure at a dose rate 
of 1.2 e-/A? per frame for a total electron dose of 60 e/A” per movie. 
Objective-lens underfocus settings varied between 0.6 mand 1.6m. 
Data collection was performed using software EPU (FEI). All image 
processing was performed in the Relion-3.0 pipeline”. Motion cor- 
rection and dose weighting were performed using MotionCor2‘8, 
and defocus values were estimated with Gctf* using a pixel size of 
0.8 A/pixel. A total of 3,513 movie stacks were selected following 
motion correction and CTF estimation, and 805,156 particles were 
auto-picked using RELION-3.0. Particle images were extracted and 
binned by 2 (1.6 A/pixel, 100 pixel box size) and subjected to two- 
dimensional (2D) classification. A total of 444,247 particles were 
selected corresponding to good 2D class averages and subjected 
to three-dimensional (3D) classification imposing 7symmetry and 
using an initial model generated from a subset of the particles. A 
total of 129,653 particles were chosen froma 3D class showing strong 
secondary-structural elements and subjected to 3D auto-refinement 
with Tsymmetry. The particles were re-centred and re-extracted to 
their original pixel size of 0.8 A/pixel. These particles were subjected 
to 3D auto-refinement with Tsymmetry and the yield map was then 
postprocessed towards 2.6 A resolution based on the gold-standard 
Fourier shell correlation (FSC) 0.143 criterion. The pixel size of the 
map was manually adjusted using Relion image handler to match the 
physical pixel size of the images. Local resolution was calculated in 
Relion 3.0 using ResMap™. 


Model building and refinement 

The BMC3 crystal structure (PDB ID: 60T7) stripped of hydrogens 
and waters was used as an initial model and manually docked into 
the cryo-EM density using UCSF Chimera*". The structural model 
was subject to real space refinement in Phenix against the cryo-EM 
map with geometry restraints for the Fe-binding sites and molecular 
coordinates for the Cys—HA ligand. The atomic model was manually 
improved using Coot. Tightly bound waters were identified based 
on clear density in the EM density map. Whereas the structural flex- 
ibility of the hydroxamate sites manifested in poor electron density, 
the twofold interface was much more rigid and unambiguous density 
was observed for Zn-binding. A tryptophan at the 66 position, which 
had shown high-temperature factors in the BMC3 crystal structure, 
was identified in multiple conformations in the EM density map. The 
final model was subjected to real space refinement using Phenix” and 
evaluated using MolProbity”. All molecular graphics images were 
rendered in PyMoL or UCSF ChimeraxX. 


Encapsulation of rhodamine in BMC3 cages 

BMC3 cages were self-assembled in a low-O, atmosphere in the pres- 
ence of rhodamine for the passive encapsulation of the dye. Solutions 
containing 20 1M BMC3 were incubated with 20 1M FeSO,, 60 uM ZnCl, 
and 2 mM rhodamine. A control sample was prepared in the absence 
of added metal ions (20 uM BMC3 incubated with 2 mM rhodamine). 
Samples were incubated for 2-3 hand concentrated sevenfold before 
overnight incubation. Protein solutions were buffer exchanged ona 
PD-10 desalting column using a buffer containing 20 mM Tris (pH 8.5) 
(with 5 uM FeSO, and 10 pM ZnCl, supplemented for solutions already 
containing metal ions) to separate unassociated dye from protein. Cage 
solutions were split in two: one half was treated with 1 mM EDTA and 
incubated for 2 h before washing. All protein solutions were addition- 
ally washed three times using a centrifugal filter to completely remove 
any remaining free rhodamine. 

Fluorescence measurements were performed using 6 1M protein 
solutions after the previously mentioned wash steps. For eachsample, 
an excitation wavelength of 555 nm with a2 nm slit width was used and 
emission was measured between 560 and 650 nm witha2 nm slit width 
and 0.2 s integration time. For the time-course experiments, cages 
encapsulating rhodamine were washed three times after 4 days and after 
7 days and diluted to 6 uM before fluorescence measurements. AUC 
measurements were performed at the A,,,, of the cytochrome (415 nm) 
and at the A,,,,, of rhodamine (555 nm) to assess whether there was a 
sufficiently large rhodamine signal associated with BMC3 cages. Ultra- 
violet—visible light (UV-vis) absorbance measurements were performed 
oneach solution to measure the protein and rhodamine concentrations. 
Difference spectra were taken between each rhodamine-incubated 
sample and BMC3 protomer to eliminate any background signal. 


Statistics and reproducibility 

All reported samples represent technical replicates. The ns-TEM 
micrograph of BMC2 cages after 3D crystal dissolution (Fig. 2a) is 
representative of experiments repeated independently four times. 
AUC experiments for BMC2 (Fig. 2b) were performed in duplicate. Self- 
assembly of BMC3 cages and subsequent AUC characterization (Fig. 3a) 
were performed the following number of times: BMC3 protomer 
(n= 2), +Fe** (n= 4), +Zn”* (n=4), +Fe’, +Zn”* (n= 6). Self-assembly of 
BMC4 cages and subsequent AUC characterization (Fig. 4a) was per- 
formed the following number of times: BMC4 protomer (n = 2), +Fe** 
(n=1), +Zn** (n=1), +Fe?, +Zn** (n=5). Mass spectra (Extended Data 
Fig. 1c-f) were collected in duplicate for native and HA-labelled pro- 
teins; AUC experiments were performed in duplicate. TEM charac- 
terization of BMC constructs (Extended Data Fig. 3) were performed 
the following number of times: dissolved BMC1 crystals (n = 1), dis- 
solved BMC2 crystals (n = 4), BMC2 +EDTA (n = 2), self-assembled 
BMC3 cages (n= 5), BMC3 +EDTA (n= 4). AUC experiments following 
the incubation of BMC3 with first-row transition metals (Extended 
Data Fig. 6a) were performed in duplicate. Self-assembly of BMC3 
in the presence of Fe(acetylacetonate), (Extended Data Fig. 6b) was 
performed in duplicate. BMC3 cage disassembly in the presence of 
EDTA (Extended Data Fig. 6c) was performed in triplicate. AUC char- 
acterization of BMC variants after equilibration at different tempera- 
tures (Extended Data Fig. 6d) was performed the following number 
of times: BMC3 at 50 °C (n= 2), BMC3 at 70 °C (n= 2), BMC4 at 50 °C 
(n=3), BMC4 at 70 °C (n=3), BMC4 at 90 °C (n=3). Thermal unfolding of 
BMC variants as measured by CD spectroscopy (Extended Data Fig. 6d) 
was performed in duplicate. Treatment of BMC3 cages with chemical 
reductants (Extended Data Fig. 6e) was performed in duplicate. Cryo- 
EM characterization of BMC3 cages was performed after collecting 
4,672 movie stacks. Extended Data Figure 7a shows a representative 
micrograph and three representative 2D class averages. Fluorescence 
characterization of BMC3 samples incubated with rhodamine were 
performed (Extended Data Fig. 8a) in triplicate. AUC characterization 
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of BMC3 cages encapsulating rhodamine (Extended Data Fig. 8b) was 
performed in duplicate. UV-vis characterization of BMC3 samples 
incubated with rhodamine (Extended Data Fig. 8c, d) was performed 
in triplicate. Repeated fluorescence characterization of a solution con- 
taining BMC3 cages encapsulating rhodamine (Extended Data Fig. 8e) 
was performed in duplicate. 


Reporting summary 
Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 
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and 6OVH (BMC3 cryo-EM) in the PDB or EMD-20212 at The Electron 
Microscopy Data Bank. 
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Extended Data Fig. 1| Characterization of the IHA ligand and the BMC 
constructs. a,b, NMR spectra of N-hydroxy-2-iodoacetamide in 

DMSO-d, for'H (a) and “C (b). c-f, ESI-MS of as-isolated and HA-functionalized 
BMC constructs, and AUC profiles of HA-functionalized protomers for BMC1 


(c), BMC2 (d), BMC3 (e) and BMC4 (f). The calculated masses for each 
unlabelled protein are determined by summing the mass of the polypeptide 
sequence and thec-type haem (618 Da) covalently linked to the cytochrome. 
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Extended Data Fig. 2| Structural comparison of CFMC1and BMC1 cages. mutations). c,d, Comparison of C,and C;symmetric interfaces and 
a, Thesymmetric substructures of the CFMC1 dodecameric unit and its corresponding metal binding sites for CFMC1 (c) and BMC1 (d). Full cages are 
per-protomer SASA and BSA values. Associative surfaces on the protomers are shownas surfaces; insets show details of each interface. Fe and Znions are 
coloured red for homologous interactions and red/orange or blue/cyan for represented as orange and teal spheres, respectively. e, Cartoon 
heterologous interactions (right). b, Summary of engineered metal- representation of a full-size BMC1 cage with all metal ions shownas spheres. 


coordination motifs for BMC constructs (see Supplementary Table 1 for all PDBID:3M4B (CFMC1), 60T9 (BMC1). 


Dissolved BMC1 crystals 


Dissolved BMC2 crystals BMC2 +EDTA 


Self-assembled BMC3 cages BMC3 +EDTA 


Extended Data Fig. 3| ns-TEM characterization of BMC constructs. 

a,b, Dissolved Fe:Zn:BMC1 (a) and Fe:Zn:BMC2 crystals (b) ina buffer 
containing 100 mM HEPES (pH 7.5), 200 mM MgCl, and 800 uM ZnCl.,. c, Self- 
assembled Fe:Zn:BMC3 cages ina buffer containing 20 mM Tris (pH 8.5), 
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20 uM FeSO, and 60 uM ZnCl,. Histograms inb, c reflect the size distributions 
of Fe:Zn:BMC2 and Fe:Zn:BMC3 cage diameters as measured from ns-TEM 
images. Gaussian fits to both distributions are drawnas solid lines along with 
their centres and standard deviations reported. Scale bars, 50 nm. 
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Extended Data Fig. 4| Cavity volumes of BMC cages. Solvent-accessiblecavity | shownasyellowspheres in Figs. 2,4, are reproduced for comparison to the 
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Extended Data Fig. 5 | Anomalous densities of engineered metal binding 
sites and conformational flexibility of Cys82-HA site. a—j, Cartoon and stick 
representations of the symmetric interfaces of BMC1 (a, b), BMC2 (c-e), BMC3 
(f-h) and BMC4 (i,j) showing the engineered metal binding sites with the C63- 
HA ligands (a, c, f), C82-HA ligands (d, g, i) and Zn binding sites (b, e, h,j). The 
difference inthe anomalous signal between pairs of datasets above and below 
the K-shell energy of Znand Fe, respectively, are depicted as blue or orange 
meshes. Astrong signal illustrates a strong change in anomalous signal across 
the respective edge, in turn suggesting the presence of the respective metal. 
The top right corner of each panel indicates the energies of the datasets used 
for the map of the respective colour. All anomalous difference maps are 
contoured at 3o. As datasets around the Fe-edge were not available for BMC1 
and BMC3 (necessitating calculations using anomalous difference density of 
singular datasets), the calculated” values for Zn at 7.3 and 9.3 keV are 0.82 and 
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selective Zn loading. For amore quantitative analysis of the nature of the bound 
metal, ratios of the anomalous signal to the expected values (bottom left 
corner of each panel) were calculated as described inthe Methods. k, Stick 
representation of the BMC2 Cys82-HA binding site in both alternative 
conformations with the anomalous difference density over the Fe-edge shown 
as orange mesh and asimulated annealing omit map (omitting all C82-HA 
atoms and Fe) of the normal electron density as light blue mesh contoured at 
20. For all Cys—HA binding sites, arrows indicate the handedness of the binding 
site as A (right handed) or A (left handed). The reversion of handedness in 

k with the respective view angle is indicated by arrows. Colour code for atoms 
inall panels: Fe in orange, Znin blue, Sin yellow, O in red and N in dark blue. 
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Extended Data Fig. 6 | Solution characterization of self-assembled BMC3 
and BMC4 cages. a-c, The oligomerization state of BMC3 cages as monitored 
by AUC measurements following incubation with various first-row transition 
metal ions (a), incubation with Zn** and Fe** (Fe(acetylacetonate),) (b) and 
disassembly via sequestration of metal ions by EDTA (c).d, AUC profiles of BMC 
variants after equilibration for 2h at the indicated temperatures (top). Thermal 


BMC3 cages +Ascorbate 


BMC3 cages +Dithionite 


unfolding of BMC variants as measured by circular dichroism spectroscopy 

at 222 nm (bottom).e, AUC profiles of BMC following treatment with 

chemical reductants of different reduction potentials (left). ns-TEM 
micrographs (middle and right) are shown for cage samples incubated with the 
corresponding chemical reductants. 
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Extended Data Fig. 7| Cryo-EM analysis of BMC3 cages. a, Representative 
cryo-EM micrograph and 2D class averages. b, Flowchart detailing image 
processing from collected movie stacks to final map. Additional details canbe 
found inthe Methods. c, FSC curves calculated between the half-maps (black 
line), atomic model to the unmasked full map (purple line) and atomic model to 


Pane ant 


the masked full map (blue line). Resolution values are indicated at the gold- 
standard FSC 0.143 criterion. d, Local resolution estimates of the final 
reconstruction calculated using ResMap. e, Electron density shown at BMC3 C, 
interfaces highlighting poorly resolved density (reflecting high flexibility) at 
hydroxamate sites and multiple conformations of W66. 
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Extended Data Fig. 8 | Encapsulation of rhodamine inside BMC3 cages. 

a, Fluorescence characterization of BMC3 samples incubated with rhodamine. 
Cages encapsulating rhodamine were treated with EDTA and washed before 
measuring fluorescence intensity. b, AUC profiles of cages encapsulating 
rhodamine monitored at the haem Soret absorption maximum (A,,,, =415 nm) 
and rhodamine absorption maximum (A,,,,=555nm).c, UV-vis characterization 


of BMC3 samples incubated with rhodamine. d, Difference spectra of BMC3 
samples and BMC3 protomer shown inc. Free rhodamine dissolved in buffer is 
shownas dark-red dashes. e, Repeated fluorescence characterization ofa 
solution containing BMC3 cages encapsulating rhodamine over several days. 
The sample was washed three times before each fluorescence measurement. 


Extended Data Table 1| X-ray data collection, processing and refinement statistics 


Data collection 
Space group 
Cell dimensions 

a, b, c(A) 

a, B, ¥ (°) 
Resolution (A) 


No. Reflections Observed 
No. Reflections Unique 
R 


merge 
I/ol 

CC1/2 
Completeness (%) 
Redundancy 


Wilson B (A)? 


Refinement 


Resolution (A) 


No. reflections 
Rurortd Rice 
No. atoms 

Protein 

Ligand/ion 

Water 
B-factors (A)? 

Protein 

Ligand/ion 

Water 
R.m.s. deviations 

Bond lengths (A) 

Bond angles (°) 
Clashscore 
Ramachandran favored (%) 
Ramachandran allowed (%) 
Ramachandran outliers (%) 


Rotamer outliers (%) 


BMC1 


R32 


125.6, 125.6, 166.4 


90, 90, 120 


39.92 — 2.40 
(2.46 — 2.40) 


393414 (28956) 
38245 (2875) 
0.185 (2.986) 

8.1 (0.96) 
0.998 (0.501) 

99.81 (100.00) 

10.29 (10.01) 
52 


36.37 - 2.40 
(2.43 — 2.40) 


38194 (1286) 
0.2174/0.2718 


3292 
200 


Numbers in parentheses correspond to the highest-resolution shell. 


BMC2 


R32 


126.1, 126.1, 168.2 


90, 90, 120 


39.25 — 1.40 
(1.44 — 1.40) 


1863896 (88779) 
195424 (14465) 
0.056 (1.830) 
20.3 (0.84) 
1.000 (0.410) 
99.37 (98.9) 
9.54 (6.138) 
19 


31.54 - 1.40 
(1.42 - 1.40) 


195270 (8377) 
0.1659/0.1909 


3502 
244 
786 


BMC3 


R32 


126.7, 126.7, 167.8 


90, 90, 120 


91.83 — 1.85 
(1.90 — 1.85) 


885564 (66924) 
85783 (6342) 
0.096 (2.377) 

13.7 (1.04) 
0.999 (0.468) 
100.00 (100.00) 
10.32 (10.54) 
36 


91.83 - 1.85 
(1.88 - 1.85) 


85755 (3974) 
0.1826/0.2106 


3459 
240 
342 


BMC4 


P6,22 


87, 87, 63.3 
90, 90, 120 


48.47 — 1.50 
(1.54 — 1.50) 


734764 (18399) 
37652 (1334) 
0.053 (1.936) 

26.6 (1.23) 
1.000 (0.480) 
87.66 (42.2) 
19.51 (13.94) 

29 


48.47 - 1.50 
(1.53 - 1.50) 


37642 (810) 
0.1900/0.2181 


925 


124 
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Extended Data Table 2 | Cryo-EM data collection, processing, and refinement statistics 


BMC3 
(EMDB ID: EMD-20212) 
(PDB ID: 6OVH) 


Data collection and processing 


Magnification 165,000x 
Voltage (kV) 300 
Electron dose (e7/A2) 60 
Exposure rate (e7/ A2/s) 6 
Defocus range (um) 0.84 
Pixel size (A) 0.3-2.7 
Symmetry imposed T 
Total extracted particles (no.) 805,156 
Final refined particles (no.) 25,391 
Map resolution (A) 2.57 
FSC 0.143 (unmasked/masked) 2.62/2.57 
Map resolution range (A) oo— 2.57 
Applied B-factor (A2) -79 
Refinement 
Initial model used (PDB code) 60T7 
Model resolution (A) 2.57 
FSC 0.5 (unmasked/masked) 2.74/2.72 
FSC 0.143 (unmasked/masked) 2.62/2.57 
Model resolution range (A) co— 2.57 
Map sharpening B factor (A?) -79 


Model composition 


Non-hydrogen atoms 10895 
Protein residues 1272 
Ligand/ion 68 
Water 171 


B-factors (A)? 


Protein 51 
Ligand/ion 59 
Water 48 


R.m.s. deviations 


Bond lengths (A) 0.007 

Bond angles (°) 0.876 
Validation 

MolProbity score 1.33 

Clashscore 1.15 

Rotamer outliers (%) 4.55 


Ramachandran plot 
Favored (%) 99.04 
Allowed (%) 0.96 
Disallowed (%) 0.00 
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Only common tests should be described solely by name; describe more complex techniques in the Methods section. 


*< 


A description of all covariates tested 


LOU 


x A description of any assumptions or corrections, such as tests of normality and adjustment for multiple comparisons 
r Ol A full description of the statistical parameters including central tendency (e.g. means) or other basic estimates (e.g. regression coefficient) 
4 AND variation (e.g. standard deviation) or associated estimates of uncertainty (e.g. confidence intervals) 
OQ For null hypothesis testing, the test statistic (e.g. F, t, r) with confidence intervals, effect sizes, degrees of freedom and P value noted 
a Give P values as exact values whenever suitable. 
x For Bayesian analysis, information on the choice of priors and Markov chain Monte Carlo settings 
x For hierarchical and complex designs, identification of the appropriate level for tests and full reporting of outcomes 
x Estimates of effect sizes (e.g. Cohen's d, Pearson's r), indicating how they were calculated 


Our web collection on statistics for biologists contains articles on many of the points above. 


Software and code 
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Data collection X-ray crystallography: Custom software at SSRL and ALS 
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We strongly encourage code deposition in a community repository (e.g. GitHub). See the Nature Research guidelines for submitting code & software for further information. 
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- Accession codes, unique identifiers, or web links for publicly available datasets 
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Field-specific reporting 
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Data exclusions No data was excluded 
Replication All results from the experiments were successfully replicated. 
Randomization Norandomization was performed as it is not relevant to the current study. 


Blinding No blinding was performed as it is not relevant to the current study. 


Behavioural & social sciences study design 


All studies must disclose on these points even when the disclosure is negative. 


Study description Briefly describe the study type including whether data are quantitative, qualitative, or mixed-methods (e.g. qualitative cross-sectional, 
quantitative experimental, mixed-methods case study). 


Research sample State the research sample (e.g. Harvard university undergraduates, villagers in rural India) and provide relevant demographic information 
(e.g. age, sex) and indicate whether the sample is representative. Provide a rationale for the study sample chosen. For studies involving 
existing datasets, please describe the dataset and source. 


Sampling strategy Describe the sampling procedure (e.g. random, snowball, stratified, convenience). Describe the statistical methods that were used to 
predetermine sample size OR if no sample-size calculation was performed, describe how sample sizes were chosen and provide a rationale 
for why these sample sizes are sufficient. For qualitative data, please indicate whether data saturation was considered, and what criteria 
were used to decide that no further sampling was needed. 


Data collection Provide details about the data collection procedure, including the instruments or devices used to record the data (e.g. pen and paper, 
computer, eye tracker, video or audio equipment) whether anyone was present besides the participant(s) and the researcher, and whether 
the researcher was blind to experimental condition and/or the study hypothesis during data collection. 


Timing Indicate the start and stop dates of data collection. If there is a gap between collection periods, state the dates for each sample cohort. 


Data exclusions If no data were excluded from the analyses, state so OR if data were excluded, provide the exact number of exclusions and the rationale 
behind them, indicating whether exclusion criteria were pre-established. 


Non-participation State how many participants dropped out/declined participation and the reason(s) given OR provide response rate OR state that no 
participants dropped out/declined participation. 


Randomization If participants were not allocated into experimental groups, state so OR describe how participants were allocated to groups, and if 
allocation was not random, describe how covariates were controlled. 


Ecological, evolutionary & environmental sciences study design 


O 


py) 


All studies must disclose on these points even when the disclosure is negative. 


Study description Briefly describe the study. For quantitative data include treatment factors and interactions, design structure (e.g. factorial, nested, 
hierarchical), nature and number of experimental units and replicates. 
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Research sample Describe the research sample (e.g. a group of tagged Passer domesticus, all Stenocereus thurberi within Organ Pipe Cactus National 
Monument), and provide a rationale for the sample choice. When relevant, describe the organism taxa, source, sex, age range and 
any manipulations. State what population the sample is meant to represent when applicable. For studies involving existing datasets, 
describe the data and its source. 


Sampling strategy Note the sampling procedure. Describe the statistical methods that were used to predetermine sample size OR if no sample-size 
calculation was performed, describe how sample sizes were chosen and provide a rationale for why these sample sizes are sufficient. 


Data collection Describe the data collection procedure, including who recorded the data and how. 
Timing and spatial scale | /ndicate the start and stop dates of data collection, noting the frequency and periodicity of sampling and providing a rationale for 
these choices. If there is a gap between collection periods, state the dates for each sample cohort. Specify the spatial scale from which 


the data are taken 


Data exclusions If no data were excluded from the analyses, state so OR if data were excluded, describe the exclusions and the rationale behind them, 
indicating whether exclusion criteria were pre-established. 
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Reproducibility Describe the measures taken to verify the reproducibility of experimental findings. For each experiment, note whether any attempts to 
repeat the experiment failed OR state that all attempts to repeat the experiment were successful. 


Randomization Describe how samples/organisms/participants were allocated into groups. If allocation was not random, describe how covariates were 
controlled. If this is not relevant to your study, explain why. 


Blinding Describe the extent of blinding used during data acquisition and analysis. If blinding was not possible, describe why OR explain why 
blinding was not relevant to your study. 


Did the study involve field work? Yes No 


Field work, collection and transport 


Field conditions Describe the study conditions for field work, providing relevant parameters (e.g. temperature, rainfall). 
Location State the location of the sampling or experiment, providing relevant parameters (e.g. latitude and longitude, elevation, water 
depth). 


Access and import/export Describe the efforts you have made to access habitats and to collect and import/export your samples in a responsible manner and 
in compliance with local, national and international laws, noting any permits that were obtained (give the name of the issuing 
authority, the date of issue, and any identifying information). 


Disturbance Describe any disturbance caused by the study and how it was minimized. 


Reporting for specific materials, systems and methods 


We require information from authors about some types of materials, experimental systems and methods used in many studies. Here, indicate whether each material, 
system or method listed is relevant to your study. If you are not sure if a list item applies to your research, read the appropriate section before selecting a response. 


Materials & experimental systems Methods 
n/a | Involved in the study n/a | Involved in the study 
x Antibodies x ChIP-seq 
x Eukaryotic cell lines x Flow cytometry 
x Palaeontology x MRI-based neuroimaging 
x Animals and other organisms 
x Human research participants 
x]||[_] Clinical data 


Antibodies 


Antibodies used Describe all antibodies used in the study; as applicable, provide supplier name, catalog number, clone name, and lot number. iS) 
S 

& 

Validation Describe the validation of each primary antibody for the species and application, noting any validation statements on the iS 
manufacturer’s website, relevant citations, antibody profiles in online databases, or data provided in the manuscript. = 


Eukaryotic cell lines 


Policy information about cell lines 


Cell line source(s) State the source of each cell line used. 
Authentication Describe the authentication procedures for each cell line used OR declare that none of the cell lines used were authenticated. 
Mycoplasma contamination Confirm that all cell lines tested negative for mycoplasma contamination OR describe the results of the testing for 


mycoplasma contamination OR declare that the cell lines were not tested for mycoplasma contamination. 


Commonly misidentified lines ame any commonly misidentified cell lines used in the study and provide a rationale for their use. 
(See ICLAC register) 


Palaeontology 
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Specimen provenance Provide provenance information for specimens and describe permits that were obtained for the work (including the name of the 
issuing authority, the date of issue, and any identifying information). 


Specimen deposition Indicate where the specimens have been deposited to permit free access by other researchers. 


Dating methods If new dates are provided, describe how they were obtained (e.g. collection, storage, sample pretreatment and measurement), 


where they were obtained (i.e. lab name), the calibration program and the protocol for quality assurance OR state that no new 
dates are provided. 


Tick this box to confirm that the raw and calibrated dates are available in the paper or in Supplementary Information. 


Animals and other organisms 


Policy information about studies involving animals; ARRIVE guidelines recommended for reporting animal research 


Laboratory animals For laboratory animals, report species, strain, sex and age OR state that the study did not involve laboratory animals. 


Wild animals Provide details on animals observed in or captured in the field; report species, sex and age where possible. Describe how animals 
were caught and transported and what happened to captive animals after the study (if killed, explain why and describe method; if 
released, say where and when) OR state that the study did not involve wild animals. 


Field-collected samples For laboratory work with field-collected samples, describe all relevant parameters such as housing, maintenance, temperature, 
photoperiod and end-of-experiment protocol OR state that the study did not involve samples collected from the field. 


Ethics oversight Identify the organization(s) that approved or provided guidance on the study protocol, OR state that no ethical approval or 
guidance was required and explain why not. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Human research participants 


Policy information about studies involving human research participants 


Population characteristics Describe the covariate-relevant population characteristics of the human research participants (e.g. age, gender, genotypic 


information, past and current diagnosis and treatment categories). If you filled out the behavioural & social sciences study design 
questions and have nothing to add here, write "See above." 


Recruitment Describe how participants were recruited. Outline any potential self-selection bias or other biases that may be present and how 
these are likely to impact results. 


Ethics oversight Identify the organization(s) that approved the study protocol. 


Note that full information on the approval of the study protocol must also be provided in the manuscript. 


Clinical data 
Sy 
Policy information about clinical studies a 
All manuscripts should comply with the ICMJEguidelines for publication of clinical research and a completedCONSORT checklist must be included with all submissions. & 
No 
S 
Clinical trial registration Provide the trial registration number from ClinicalTrials.gov or an equivalent agency. = 
Study protocol Note where the full trial protocol can be accessed OR if not available, explain why. 


Data collection Describe the settings and locales of data collection, noting the time periods of recruitment and data collection. 


Outcomes Describe how you pre-defined primary and secondary outcome measures and how you assessed these measures. 


ChIP-seq 


Data deposition 


Confirm that both raw and final processed data have been deposited in a public database such as GEO. 


Confirm that you have deposited or provided access to graph files (e.g. BED files) for the called peaks. 


Data access links For "Initial submission" or "Revised version" documents, provide reviewer access links. For your "Final submission" document, 
May remain private before publication. provide a link to the deposited data. 

Files in database submission Provide a list of all files available in the database submission. 

Genome browser session Provide a link to an anonymized genome browser session for "Initial submission" and "Revised version" documents only, to 
(e.g. UCSC) 


enable peer review. Write "no longer applicable" for "Final submission" documents. 
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Methodology 

Replicates Describe the experimental replicates, specifying number, type and replicate agreement. 

Sequencing depth Describe the sequencing depth for each experiment, providing the total number of reads, uniquely mapped reads, length of 
reads and whether they were paired- or single-end. 

Antibodies Describe the antibodies used for the ChiP-seq experiments; as applicable, provide supplier name, catalog number, clone 
name, and lot number. 

Peak calling parameters Specify the command line program and parameters used for read mapping and peak calling, including the ChIP, control and 
index files used. 

Data quality Describe the methods used to ensure data quality in full detail, including how many peaks are at FDR 5% and above 5-fold 
enrichment. 

Software Describe the software used to collect and analyze the ChIP-seq data. For custom code that has been deposited into a 


community repository, provide accession details. 


Flow Cytometry 


Plots 


Confirm that: 


The axis labels state the marker and fluorochrome used (e.g. CD4-FITC). 


The axis scales are clearly visible. Include numbers along axes only for bottom left plot of group (a ‘group’ is an analysis of identical markers). 


All plots are contour plots with outliers or pseudocolor plots. 


A numerical value for number of cells or percentage (with statistics) is provided. 


Methodology 
Sample preparation Describe the sample preparation, detailing the biological source of the cells and any tissue processing steps used. 
Instrument Identify the instrument used for data collection, specifying make and model number. 
Software Describe the software used to collect and analyze the flow cytometry data. For custom code that has been deposited into a 


community repository, provide accession details. 


Cell population abundance _| Describe the abundance of the relevant cell populations within post-sort fractions, providing details on the purity of the samples 
and how it was determined. 


Gating strategy Describe the gating strategy used for all relevant experiments, specifying the preliminary FSC/SSC gates of the starting cell 
population, indicating where boundaries between "positive" and "negative" staining cell populations are defined. 


Tick this box to confirm that a figure exemplifying the gating strategy is provided in the Supplementary Information. 


810% 1290120 


Magnetic resonance imaging 


Experimental design 


Design type Indicate task or resting state; event-related or block design. 


Design specifications Specify the number of blocks, trials or experimental units per session and/or subject, and specify the length of each trial 
or block (if trials are blocked) and interval between trials. 


Behavioral performance measures State number and/or type of variables recorded (e.g. correct button press, response time) and what statistics were used 
to establish that the subjects were performing the task as expected (e.g. mean, range, and/or standard deviation across 


subjects). 
Acquisition 

Imaging type(s) Specify: functional, structural, diffusion, perfusion. 

Field strength Specify in Tesla 

Sequence & imaging parameters Specify the pulse sequence type (gradient echo, spin echo, etc.), imaging type (EPI, spiral, etc.), field of view, matrix size, 
slice thickness, orientation and TE/TR/flip angle. 

Area of acquisition State whether a whole brain scan was used OR define the area of acquisition, describing how the region was determined. 

Diffusion MRI [_] Used [_] Not used 


Preprocessing 


Preprocessing software Provide detail on software version and revision number and on specific parameters (model/functions, brain extraction, 
segmentation, smoothing kernel size, etc.). 


Normalization If data were normalized/standardized, describe the approach(es): specify linear or non-linear and define image types 
used for transformation OR indicate that data were not normalized and explain rationale for lack of normalization. 


Normalization template Describe the template used for normalization/transformation, specifying subject space or group standardized space (e.g. 
original Talairach, MNI305, ICBM152) OR indicate that the data were not normalized. 


Noise and artifact removal Describe your procedure(s) for artifact and structured noise removal, specifying motion parameters, tissue signals and 
physiological signals (heart rate, respiration). 


Volume censoring Define your software and/or method and criteria for volume censoring, and state the extent of such censoring. 


Statistical modeling & inference 


Model type and settings Specify type (mass univariate, multivariate, RSA, predictive, etc.) and describe essential details of the model at the first 
and second levels (e.g. fixed, random or mixed effects; drift or auto-correlation). 


Effect(s) tested Define precise effect in terms of the task or stimulus conditions instead of psychological concepts and indicate whether 
ANOVA or factorial designs were used. 


Specify type of analysis: Whole brain ROI-based Both 


Statistic type for inference 


Specify voxel-wise or cluster-wise and report all relevant parameters for cluster-wise methods. 
(See Eklund et al. 2016) 


Correction Describe the type of correction and how it is obtained for multiple comparisons (e.g. FWE, FDR, permutation or Monte 
Carlo). 


Models & analysis 


n/a | Involved in the study 


Functional and/or effective connectivity 


Graph analysis 


Multivariate modeling or predictive analysis 


Functional and/or effective connectivity Report the measures of dependence used and the model details (e.g. Pearson correlation, partial 
correlation, mutual information). 


Graph analysis Report the dependent variable and connectivity measure, specifying weighted graph or binarized graph, 
subject- or group-level, and the global and/or node summaries used (e.g. clustering coefficient, efficiency, 
etc.). 
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Multivariate modeling and predictive analysis Specify independent variables, features extraction and dimension reduction, model, training and evaluation 
metrics. 
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Around the globe, governments have pledged to remove support for 
coal, oil and gas, noting that such fossil fuel subsidies “undermine 
efforts to deal with climate change” by keeping greenhouse gas emis- 
sions higher than they otherwise would be’. Jewell et al.? used results 
of integrated assessment models to infer that eliminating subsidies 
would yield “limited emission reductions...except in energy-exporting 
regions”, and described the emission reduction benefits as “small”. This 
characterization is potentially misleading, and here we use a simple, 
sector-specific model to show how the emission reductions from pro- 
ducer subsidy reform could be more material than Jewell et al. suggest?. 
Fossil fuel producer subsidies delay a low-carbon transition in ways 
both material and political, and they deserve greater attention and 
transparency in global modelling analyses, as well as in policy-making. 

The study by Jewell et al.? provides important findings related to fossil 
fuel subsidy removal. Using a synthesis of five Integrated Assessment Mod- 
els (IAMs), they find that subsidy removal could reduce global emissions 
by 0.5 to 2 gigatonnes (Gt) of carbon dioxide (CO,) by 20307. Jewell et al. 
characterize these global emission reductions as “unexpectedly small”, 
while noting that they would largely occur within a few energy-exporting 
countries and regions (Russia, the Middle East and Latin America). 

We argue that the emissions reductions from subsidy removal are 
not small. By contrast, 0.5-2 Gt CO, amounts to roughly one quarter of 
the energy-related emission reductions pledged by all countries under 
the Paris Agreement (4-8 Gt CO,), all froma single policy approach that 
also comes with strong fiscal and other environmental benefits‘. This 
scale of emission reductions should not necessarily be surprising or 
unexpected: few policy analysts hope that any single instrument can 
deliver reductions at the scale needed to meet climate goals. 

Moreover, we argue that the impact of subsidy removal on emissions 
is likely to be more substantial than Jewell et al. find’, particularly when 
considering support for fossil fuel producers in high-income countries. 
Although their approach uses common IAM techniques, it does not 
adequately capture investment dynamics in the supply of new fossil 
fuels, and therefore misses a major pathway for subsidy reform to affect 
CO, emissions. Specifically, their approach does not consider how the 
timing of producer subsidies (concentrated early inan investment life- 
time) and the higher effective discount rates of investors (as compared 
with society) affect investment decisions to bring on newsupplies of oil. 

Oil provides more of the world’s energy than any other fuel, and 
exploration and development of supplies remain robust°. The model 
in ref.” of producer subsidies to oil distributes regional subsidy totals 
equally to all oil fields—both new and already-producing fields—in each 
region, proportionate to annual output. However, that is often not 
how subsidies to oil producers work. Instead, governments frequently 
target subsidies more towards new capital investment than ongoing 
production. By lowering upfront cash flow requirements, government 
subsidies boost project investment metrics (suchas rate of return or net 
present value), which leads producers to drill more new wells than they 


would otherwise. This locks in higher future fossil fuel production and 
thus also higher future consumption and greenhouse gas emissions°®. 

Using the example of one type of subsidy for investment—acceler- 
ated depreciation of new capital investment—we illustrate how oil 
subsidies could have a bigger effect on global CO, emissions than in 
Jewell et als analysis”. This particular form of support, exemplified by 
the intangible drilling cost (IDC) subsidy in the United States, allows 
companies to quickly write down capital investments that would oth- 
erwise depreciate more gradually, providing a boost to cash flow at 
the beginning of a project. 

The IDC subsidy is underappreciated in Jewell et al’s analysis” because 
they value it only at the reported value of about US$0.20 per barrel (all 
dollar prices herein refer to 2016)’. This reflects the reduction in cash 
flow to the United States Treasury that results from the delay in annual 
tax payments. But whereas the USA government may be almost indiffer- 
ent whether it receives tax revenues this year or the next, oil company 
investors are not, because they can use that cash flow to accelerate 
new investment. 

If the IDC were valued not ona nominal cash basis but instead ona 
present value basis, using investor discount rates of 10% to 20%, the 
subsidy would make it substantially easier to invest in new oil fields, 
decreasing the breakeven oil price of new projects by US$4 to $7 per 
barrel (Table 1). 

Changes in breakeven economics of this scale could have a substan- 
tial effect on global oil market price dynamics and consumption. This 
would especially be the case if subsidy removal were to render uneco- 
nomic many of the new projects on course to be developed before 
2030. This outcome could well arise, since the USA has a substantial 
fraction (more than 40%) of the new oil projects that can be produced 
by 2030 (Extended Data Fig. 1). Other producers with substantial new 
supplies planned, such as Canada’ and Norway’, also offer accelerated 
depreciation of new oil capital investments. 

Table 1 estimates how the global oil market may respond to removal 
of the accelerated depreciation subsidies, based ona simple oil market 
model (see Methods). As shown, in the low-oil-price world featured 
by Jewell et al.”, the effect of removing the depreciation subsidy to 
producers could reduce global oil consumption by 440 to 770 million 
barrels in 2030. 

Yet the previous analysis by Jewell et al. includes only a very small 
fraction of this effect. They do not report this result, but we estimate 
it to be roughly 21 million barrels (Table 1, column A). 

We therefore believe that, in their low oil price case, Jewell et al. 
missed a reduction in global CO, emissions from oil combustion on 
the order of 200 to 300 million tons CO, that could result from the 
removal of a single type of subsidy common in the USA and other oil- 
producing countries. 

The actual outcome on net global CO, emissions from all fuels is likely 
to be somewhat lower because coal or gas might substitute for some 
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Table 1| Removing subsidies that accelerate write-down of capital investment reduces global oil consumption 


(A) Subsidy valued on cash basis, Subsidy valued on present value basis at given investor discount rates 


as in United States government 
source used by OECD and ref.” 


(B) Rate commonin 
academic literature 


(C) Rate common in 
industry studies 


(D) Higher-risk rate (if weakened investor 
climate or higher-risk fields)"° 


10% 15% 20% 
Effect of subsidy on economics of new oil projects 
Effect on projects’ breakeven price 0.20 4.20 5.80 7.30 
(US$ per barrel) 
Market effects of subsidy removal for high-oil-price case in 2030 
Increase in global oil price 0.07 1.40 1.90 2.40 
(US$ per barrel) 
Decrease in globaloilconsumption 4 76 110 130 
(millions of barrels) 
Decrease in globalCO, emissions 1 30 42 52 
from oil (millions of tonnes of CO,) 
Market effects of subsidy removal for low-oil-price case in 2030 
Increase in global oil price 0.13 2.80 3.90 4.90 
(US$ per barrel) 
Decrease in globaloilconsumption 21 440 620 710 
(millions of barrels) 
Decrease in globalCO, emissions 8 180 250 310 


from oil (millions of tonnes of CO,) 


These estimates of the effect of subsidies on projects’ breakeven prices (first row) are calculated on a present value basis, as the production-weighted averages across nearly 800 discovered oil 
fields in the United States (see Methods). By contrast, Jewell et al.? value the fast depreciation subsidy only ona cash basis, spread across all fields; while they do not report the value of this sub- 
sidy in their analysis, we estimate it from the same primary sources they used to be about US$ 0.20 per barrel (column A), as described further in the Methods. We estimate the market effects of 
removing these subsidies using a simple oil market model, at three different investor discount rates (columns B to D), all of which are on a nominal basis (no deduction for inflation). We assume 
that not-yet-developed USA oil projects are higher up the oil cost curve in 2030 (as is oil from other countries that also have a corresponding accelerated depreciation subsidy, like Canada 

or Norway), such that increases in the breakeven prices of these fields could well have a direct effect on long-term prices and consumption levels. We also assume here that subsidy removal 
begins immediately (in 2019), whereas Jewell et al. assume subsidy phase-out starts in 2020 and is completed in 2030. However, producer subsidy removal is not subject to the same concerns 
as consumer subsidy removal—namely equity and locked-in consumer behaviour—and thus would not need to be phased in so gradually. 


of the lost oil consumption, though concurrent removal of subsidies 
for these fuels would minimize this effect. IAM models, like those used 
by Jewell et al.”, are well suited to evaluating these interactions. Yet the 
scale on which CO, emissions from oil have potentially been underes- 
timated—equivalent to 10% to 60% of the reported global effect” due 
to removal of all subsidies (0.5 to 2 Gt CO, in 2030)—suggests that oil 
producer subsidies deserve greater attention and transparency in 
global modelling analyses. 

The investment-oriented approach to modelling subsidies used 
here and the broader, average cost-curve approach of ref. ? are not 
incompatible. Fossil fuel supply in IAMs could be modelled using an 
investment approach and vintage capital structure, as is often applied 
to power plants that have upfront costs and default lifetimes”. Insuch 
an approach, new oil deposits would also be modelled as prospective 
investments, as demonstrated here, using realistic discount rates of 
10% to 20% that are common in the oil industry". 

In fact, subsidies may have an even more important role than wecan 
quantify here. Extra company revenue resulting from subsidies can be 
used not only for more drilling, but also for product promotion, politi- 
cal activities and other efforts that fortify the industry’s incumbent 
status. Subsidies also have a symbolic effect, in that they communicate 
the normative position that this industry and its activities are beneficial 
for society asa whole and, therefore, should be encouraged. Jewell et al.” 
disregard these socio-political effects when downplaying the value of 
removing fossil fuel subsidies. 

The economic, political and symbolic effects of subsidies reinforce 
each other”. For example, subsidies can beget more subsidies, with 
new, long-lived fossil fuel infrastructure in turn (1) requiring further 
subsidization down the line to continue operating”, and (2) yielding 
beneficiaries who will vigorously defend continued subsidization’’. Since 
there can bea revolving door between government staff and subsidy 
recipients, public officials may find it even harder to pass strong climate 
and energy policies’. Indeed, the most troubling impact and legacy of 
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fossil fuel subsidies may be the political barriers that fossil fuel produc- 
ers have erected in recent decades against decarbonization efforts’”"’. 

Rapid low-carbon transitions consistent with the guardrails of the Paris 
Agreement require dramatically reduced fossil fuel production”. Subsi- 
dies to fossil fuel companies pose formidable financial, institutional and 
political obstacles to this transition, impeding the efficacy of greenhouse 
gas emission reduction strategies. The apparent small dollar values of 
producer subsidies in official, government-approved ledgers, and the 
limited emissions impact suggested by global models such as those used 
by Jewell et al.2, can be misleading. The actual impacts, particularly when 
one considers their social and political effects, are far greater. 

Methods areinthe Supplementary Information to this Matters Arising 
Comment. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The authors declare that data supporting the calculations in columns B 
throughto D of Table 1 areincluded as Supplementary Information. The 
raw data analysed by the authors for Extended Data Fig. 1 are available 
from Rystad Energy in their UCube database, but restrictions apply tothe 
availability of these data, which were used under license for the referenced 
study, and so are not publicly available. Raw data are available from the 
authors upon reasonable request and with permission of Rystad Energy. 


Code availability 

No custom code or mathematical algorithms were used to generate 
results reported in this paper. The entirety of the oil market model is 
provided as equation (1) in the Methods. 
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Extended Data Fig. 1| Cost curve of world oil production in 2030. The Further (16) blocks represent the USA or Canada, since they are major new 
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In 2009, the G20 countries pledged to phase out fossil fuel subsidies’. 
Our original Letter highlighted that about 95% of subsidies go to con- 
sumers and two-thirds are in the Middle East, Russia and Latin America’. 
We also found the largest emission reductions from subsidy removal 
would occur in those three regions, where low oil prices provided a 
unique political opportunity and the reforms would harm fewer poor 
people. In the accompanying Comment’, Erickson et al. argue that 
we downplay the impact of subsidy removal and the effect of subsi- 
dies for oil producers, such as the USA's intangible drilling cost (IDC) 
scheme. Here we show large variations in such schemes and estimate 
their impact to be within the range of the sensitivity analysis from our 
original article. The USA IDC may represent a unique political oppor- 
tunity for producer subsidy reform, but reforming such schemes may 
be counterproductive in countries where they are applied in tandem 
with high taxes for oil production. 

We estimated that emission reductions from subsidy removal would 
be between 2-8% and 3-15% of those required by 2030 to achieve the 
1.5 °C and 2 °C targets. We called this “unexpectedly small” because 
these estimates contrast with sweeping statements that subsidy 
removal would have “significant” effects and is “the missing link in 
the fight against climate change”. Yet we agree with Erickson et al.? 
that given the immensity of the climate challenge, these numbers are 
notable and are certainly not an argument against subsidy reform. 

Erickson et al.’ estimate the size and effect of the USA’s IDC scheme, 
which allows accelerated depreciation of drilling costs, essentially tax 
deferrals for oil producers. Their approach is different from ours in how 
subsidies are defined and measured. In our original Letter”, we used 
government inventories* * for our central estimate, because these are 
the very subsidies that governments have pledged to remove. Erickson 
et al.’ consider any regulation that makes oil production more profit- 
able to be a subsidy even if it does not involve net transfers from the 
government. This leads them to use data not from government inven- 
tories of subsidies but from analysing oil production economics. Thus, 
Erickson et al.’ analyse the hypothetical cash flow for 800 oil fields in 
the USA and calculate the effect of the IDC scheme on the breakeven 
price of individual projects—we call this the ‘effective subsidy rate’. 
They then assess the global impact of similar schemes by assuming 


all oil producers worldwide benefit from the same effective subsidy 
rate as inthe USA. 

Global IAMs can greatly benefit from such data if they are param- 
eterized for long-term global scenarios. The first set of parameters 
defines how accelerated depreciation affects the effective subsidy 
rate. This depends on a project’s breakeven price, discount rate, 
share of capital costs, the national tax regime, and the design of the 
accelerated depreciation scheme, all of which vary widely across 
countries and over time (Methods). To determine whether Erickson 
et al.'s results? would affect our original findings’, we developed a 
discounted cash flow model to analyse the effective subsidy rate for 
the USA IDC and from accelerated depreciation schemes for three 
additional countries with diverse institutional arrangements and 
geographies (Methods). 

In the case of the USA, our model provides results similar to those 
of Erickson et al.’ for the 2016 case, but the 2017 tax cut reduced the 
effective subsidy rate by about half and the recent fall in the cost of 
NorthAmerican tight oil reduced it by another 30% (Table 1, Methods). 
The effective subsidy rates from accelerated depreciation schemes 
in Canada, Norway and Russia under a range of plausible breakeven 
prices are between two and ten times smaller than the USA 2016 case. 
Using this range, we estimate the global effective subsidy rate from 
accelerated depreciation schemes to be US$0.3-1.9 per barrel [using 
central assumptions, as described in the Methods, the value is US$1.0; 
central values are shown herein in square brackets] (US dollar prices 
herein refer to 2016; Table 1). 

The uncertainty in estimating production subsidies is well known®”. 
That is why, in our original Letter”, we included a sensitivity analysis 
using an alternative estimate of production subsidies (including an 
alternative calculation of the USA’s IDC scheme)”. Oil production sub- 
sidies in that analysis for the low-oil-price scenario were about fifteen 
times higher than those reported in government inventories”. The 
oil production subsidy rates in that original sensitivity analysis are 
also generally higher than the effective subsidy rates we estimate for 
accelerated depreciation schemes (column C of Table 1). 

The second step in the analysis by Erickson et al.’ is to estimate the 
effect of accelerated depreciation schemes on global oil consumption 
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Matters arising 


Table 1 | The effect of oil production subsidies on producer costs and global oil consumption 


IAM analysis of all producer subsidies in Jewell et al.” 


(low-oil-price scenario) 


Discounted cash flow model of accelerated depreciation schemes 
(10% discount rate except for final row; see Methods) 


(A) Main estimate of production 
subsidies from ref. ° 


(B) Higher production 
subsidies from refs. °°. 


(C) Variations in tax rates, capital cost, (D) Erickson et al.? 
accelerated depreciation schemes, 


breakeven prices and elasticities (our analysis) 


Effective production subsidy rate (US$ per barrel) 


USA 0.6 24 1.9 (2019 case) 4.2 (2016 case) 
4.9 (2016 case) 
Other regions Canada 1.1 Canada 1.5 Canada 0.5-1.4 [0.9] 
Europe 0.4 Europe 2.2 Norway 0.9-2.0 [1.5] 
Russia O Russia 5.2 Russia 0.9-2.1 [1.6] 
MENA O MENA 2.3 Saudi Arabia and Nigeria O 
Global 0.2 2.6 0.3-1.9 [1.0] 
Change in global oil extraction or consumption (millions of barrels per year) for the low-oil-price scenario 
Change in extraction due to higher production subsidy estimate 590 440 


Variation due to effective subsidy rate using elasticities in Erickson et al® and 10% discount 


rate 


30-200 [110] 


Variation due to elasticity assumptions” ? 


discount rate 


using central effective subsidy rate and 10% 


20-140 [90] 


Variation due to discount rates using central effective subsidy rate and elasticities in 


90-150 [130] 440-770 [620] 


Erickson et al.? In column (C), discount rates vary from 7.5% to 20% and in column (D) from 
10% to 20% (Methods). In both cases, the discount rate for the central estimate is 15%. 


Columns A and B contain estimates of all oil production subsidies from our original article?. In our sensitivity case, the subsidy rate and its effect on oil production is higher than under acceler- 
ated depreciation schemes (column C). For the USA, the results for the 2016 case are in italics including from Erickson et al. (column D). ‘Canada’ refers to the CAJAZ (Canada, Japan, Australia 
and New Zealand) region whose oil production is dominated by Canada (over 98%). ‘MENA refers to the Middle East and North Africa region. 


with asimple oil market model. Their calculation is sensitive to supply 
and demand elasticities that are highly uncertain (Methods). They 
use a Single value for demand elasticity and a single value for supply 
elasticity for each oil price. A range of supply and demand elasticities 
from previous studies that used the same simple oil market model”’” 
changes the results by almost by an order of magnitude even under 
the same effective subsidy rate (Table 1, Methods). 

In the sensitivity analysis from our original article, we estimated 
that a more than tenfold increase in oil production subsidies would 
increase oil extraction by 590 million barrels per year (Table 1). The 
higher production subsidies (including all production subsidies, not 
just oil) would increase emission reductions from subsidy removal by 
0.3 gigatonnes of carbon dioxide per year in 2030, which is about 13% 
higher than the main estimate of the model used for that sensitivity 
analysis, or about 1% of the emission reduction required by 2030 to 
achieve the 1.5 °C or 2 °C target (Methods). 

The final parameter affecting the effective subsidy rate is the discount 
rate, which Erickson et al.? assume varies between 10-20%. The upper 
end of this range is speculative because discount rates for the oil sector 
have generally varied between 9% and 11%” (Methods). Table 1 shows 
our results using a discount rate of 10%, but our conclusions are robust 
over the full range in Erickson et al.>: a20% discount rate increases the 
global effective subsidy rate to US$0.4-2.7 [1.4] per barrel (Methods). 

This exchange highlights the importance of improving IAM param- 
eters by incorporating new data. Such data are more meaningful to 
global long-term IAMs if it is clear whether and how they are applicable 
beyond a single country at a single point in time. The generalizability 
of such data can be improved if they extend to a wider and more repre- 
sentative sample””’, which IAMs canuse, as illustrated by the sensitivity 
analysis in our original article. Finally, these data should be up-to-date 
and transparent about uncertainties, including those arising from dif- 
ferences in policy environments. 

Although the effect of accelerated depreciation schemes can be 
incorporated into IAMs by adjusting the effective subsidy rate, we also 
agree with Erickson et al.’ that IAMs should better represent oil and gas 
infrastructure in the same way as they model the vintage structure of 
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the power sector”. Another promising avenue would be to depict oil 
and gas investments using a real options ‘wait and see’ approach” and 
to model price formation in the oil market” more realistically. These 
improvements may either dampen or amplify the effects of subsidies 
in IAMs, depending on whether infrastructural inertia, ‘wait and see’ 
behaviour, and strategic markets are more or less responsive to pro- 
ducer cost signals than in today’s IAMs. 

We also strongly agree with Erickson et al.* that the social and politi- 
calimpacts of subsidy removal should always be examined in tandem 
with their emission impacts. However, it is time for social scientists to 
go beyond listing various negative effects of subsidies which are well 
documented in the literature and clearly extend beyond economics” ”° 
and instead identify opportunities and pathways for reform. That is 
why, in our original article, we complemented energy and emissions 
analysis with a discussion of the socio-political impacts of subsidies 
toidentify a political opportunity for reform in oil- and gas-exporting 
countries under low oil prices where reducing consumption subsidies 
would affect fewer poor people, relieve squeezed government budgets 
and lead to the largest emission reductions. 

Alesson from our original article is that the environmental and socio- 
political impacts of and obstacles to consumer subsidy reform vary 
between countries. This is almost certainly the case for producer subsidies 
as well. Inthe USA, the original rationale (energy security and uncertainty 
inoil drilling) forthe IDCis outdated, and the scheme now doeslittle more 
than confer an unfair advantage ona polluting, privately owned and prof- 
itable industry. Reforming this scheme is complicated by the political 
clout of theindustry, but at least its public benefits and endpoint are clear. 

However, such subsidies are much more difficult to identify, much 
less reform, in countries like Norway and Russia where oil producers 
pay very high taxes—reaching over 70% on profits. These taxes are a 
major source of government revenue used to fund public services. 
Would reforming accelerated depreciation schemes in these contexts 
also mean tax reductions for the industry? Would the endpoint be to 
bring the oil industry in line with the rest of the economy, something 
clearly not desirable either socially or environmentally? And if not, 
what would be the goal and the strategy for reform? 


Generalizing insights from the USA to the whole world is mislead- 
ing bothinterms of science and policy. Finding effective strategies to 
meet the Paris Agreement requires a detailed understanding of how 
oil production and other carbon-intensive sectors are embedded in 
national socio-political and economic contexts. 


Reporting summary 


Further information on research design is available in the Nature 
Research Reporting Summary linked to this paper. 


Data availability 


The authors declare that the data supporting the calculations are avail- 
able in the Methods or from publicly available sources cited in the 
Methods. 


Code availability 


Nocustom code or algorithms were developed for the discounted cash 
flow results reported in this paper. 
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OUT-OF-OFFICE REPLIES A 
WHAT THEY SAY ABOUT YOU 


An automated e-mail response posted on Twitter unleashed a social-media 
debate about the importance of work-life balance. By Stephana Cherak 


ne weekend last October, I received 
an out-of-office reply from an aca- 
demic faculty member: “I do not 
respond to e-mails on weekends. If 
this is an emergency, please call my 
mobile. If you do not have my mobile number, 
then you do not have a weekend emergency.” 
The tone, | felt, was telling — researchers are 
often under pressure to keep up with immense 
workloads and to stay on top of the changing 
world of science, while trying to protect their 


life outside work from being invaded by faux 
emergencies or apparently urgent e-mails. 
Indeed, inresponse to Nature's 2019 biennial 


“Thestruggle to balance 
work and life is a particular 
challenge for graduate 
students.” 


© 2020 Springer Nature Limited. All rights reserved. 


PhD survey, only 37% of PhD students agreed 
that their institute supported a good work-life 
balance. Thirty-four per cent felt it failed todo 
so, and nearly 40% of respondents said they 
were unsatisfied with their work-life balance. 

Researchers, including PhD students, 
senior scientists and graduate students like 
myself, are not the only ones facing pressure 
to work harder and faster. One 2017 survey (see 
go.nature.com/2rxh4rb) of workers in Brit- 
ain, for example, found that nearly half said 
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Tales from 
Twitter 


When | tweeted an out-of-office message 
in October that read: “I do not respond 
to e-mails on weekends,” the post went 
viral. So far, it has received 41,200 likes 
and more than 4,600 re-tweets — 

nearly 3.1 million Twitter accounts have 
interacted with the post. I've curated a 
selection of my favourite replies so far. 


Lives aren't controlled by e-mail 

“| had a discussion with a busy academic 
recently about an international trip 

where she was unexpectedly unable to 
access e-mail and it was so backlogged 
by her return that she just ignored/ 
deleted all unopened mail. And then 
marvelled that the world reallydidn’t stop.” 
@JessicaRenee_83 


“When | took parental leave recently, | 
didn’t respond to e-mails for almost an 
entire month. The world didn’t end. I'd 
gladly do it again.” @tomkXY 


Honouring boundaries and timelines 

“| haven't quite reached this level of 
firmness in my boundaries, but | do know 
that my life has gotten much better 
since | decided that | don’t need ‘fastest/ 
best/most consistent e-mail responder’ 
to be part of my professional legacy.” 
@popmediaprof 


“| used one of these responders for years as 


a prof. Now | just take my sweet time to reply 


to e-mails and people learn. Their timelines 
aren't my timelines.” @DoctorLindy 


Do as | say, not as! do 

“Fully support this. | also recommend 
adding a statement to any e-mails sent 
out of regular business hours to the effect 
of ‘this email is being sent out of normal 
business hours and | don’t expect an 


an 


immediate reply’.” @runforbooze 


“On weekends | respond to e-mails on my 
own timeline. Sometimes that means right 
away, but most times it means not until 
Monday. If a student reads too much into 
a quick reply and comes to expect that, 

| simply say that past performance is no 
guarantee of future results.” @Meteodan 


And the winner is... 


“There is no such thing as an academic 
emergency.” @JLasaine 
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their jobs require them to work very hard — 
compared with less than one-third in 1992 (see 
‘Off balance’). When I tweeted the automated 
response in October, it went viral, suggesting 
work-life balance is an issue throughout the 
working world, notjust in academia (see ‘Tales 
from Twitter’). 

Forgoing work-life balance in order to be as 
productive as possible is tempting in academia, 
because those who are the most productive are 
also the most rewarded. Although hard work 
might be valued, it’s hardly the only factor in 
a successful career — numerous studies have 
shown that as we work more, the quality of 
what we do decreases. The scientific commu- 
nity should learn to value quality over quantity. 

The struggle to balance work and life is a par- 
ticular challenge for graduate students. They 
often find the line between the two becomes 
blurred with efforts to seem productive and 
important. Institutions have a responsibility 
to students and employees to protect their 
health; often this means helping people to 
manage their work-life balance. But the cul- 
ture at science institutes can tend towards long 
working hours. As Meghan Duffy, an ecologist 
at the University of Michigan in Ann Arbor, told 
Nature in 2017: “The idea that you have to put 
in long hours is pervasive. If you’re not work- 
ing 60 or 80 hours a week, you're not doing 
enough. It makes people insecure.” 

After receiving hundreds of comments in 
response to my tweet about the efforts people 
go to to maintain balance, I was prompted to 
reflect on my own techniques. Here are a few 
of the principles that I’ve adopted as I strive to 
balance work and other aspects of life: 


Don’trush, set your own pace. Graduate stu- 
dents often want immediate results. thought 
the more courses I took or the more experi- 
ments! planned in one week, the earlier 1 might 
graduate. During my undergraduate degree! 
started arriving at university at 6 a.m. and leav- 
ing at 8 p.m. But, in graduate school, | soon 
learnt my studies were nota sprint, but a mar- 
athon. Make that marathon work for you. Dur- 
ing my master’s degree! was also competing at 
ahigh level in athletics; this meant my course- 
work needed to be restructured so 1 could be 
successful at both. In every department there 
are resources to help you to personalize your 
programme. Take advantage of these oppor- 
tunities, and don’t feel the need to align with 
the status quo — youccan save yourself a lot of 
sleepless nights by planning strategically. 


Ask for support. Academics often think the 
work they do should be perfect. This causes 
trouble: graduate students often neglect their 
lives outside work to strive for perfection. But 
as astudent, you are expected to ask for help, 
assistance and guidance. Finding the confi- 
dence to ask for support from supervisors 
and mentors is beneficial and necessary for 
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OFF BALANCE 


Nearly half of UK workers across all sectors say 
their job requires them to work very hard. 
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growth. Academic studies can mean working 
alone, but it is important to remember that 
one of the most rewarding aspects of being a 
researcher is collaborating with others. Many 
would say that you are only as strong as those 
you surround yourself with. 


Define your own personal balance. Graduate 
students often spread themselves too thinly, 
agreeing to spend longer hours inthe lab orin 


“I thought the more 
courses I took or the more 
experiments I planned, the 
earlier I might graduate.” 


front of a laptop at the expense of their time 
outside work. Although there are certain 
non-negotiable expectations and levels of pro- 
fessionalism required for a graduate degree, 
attaining these shouldn’t come at the cost of 
your health and well-being. Dedicate time to 
defining and finding your personal balance, 
and honour this throughout your career. 


Always be kind. Academia has rough edges 
and the way that researchers communicate 
can often seem curt or unpleasant for some- 
one who isn’t used to it. Seek to pursue your 
own acts of academic kindness. Perhaps thank 
colleagues more for their hard work — or 
even send acard. This might inspire others to 
shift their own behaviour and perhaps to be 
a bit more generous and compassionate. By 
being kind ourselves, we might help to make 
academia, as a whole, just a little bit kinder. 


Stephana Cherak is a PhD student in the 
Department of Community Health Sciences, 
University of Calgary, Canada. Her PhD 
research focuses on patient and family- 
centred critical care. 
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TECH TOOLS TO MAKE RESEARCH 
MORE OPEN AND INCLUSIVE 


Laboratory heads are deploying apps and software in innovative ways 
to build broad and diverse research groups. By Kendall Powell 


oris Taylor knows the sting of being set 

apart as different. Asa young, lesbian 

woman starting her career in regen- 

erative-medicine research in the late 

1980s, she was often excluded from 
faculty functions and private meetings onthe 
golf course. “You want to be differentiated 
when doing great science, but not because of 
who you are,’ she says. 

Galvanized by her experiences, Taylor has 
built a laboratory group at the Texas Heart 
Institute in Houston that strives to be diverse 
and culturally sensitive. She knew she had 
come close when she overheard an undergrad- 
uate researcher telling his mother about alab 
birthday celebration: “I was the only white guy 
there — it was great!” 

Taylor thought carefully about how best to 


build a diverse, inclusive and equitable team, 
representing a range of perspectives and back- 
grounds, and like many other investigators 
who value such things, she has increasingly 
relied on technology to advance those goals. 

Group leaders say that these tools can help 
to flatten power differentials between lab 
members and keep people connected and 
communicating oncommon, andimportantly, 
even ground. The tools are familiar, and even 
ubiquitous — Slack, Skype and WhatsApp 
(Taylor’s tool of choice), for example. But 
when deployed strategically, these apps 
can promote a more level playing field to 
benefit colleagues from disadvantaged and 
under-represented backgrounds, those with 
disabilities, or those who might work and think 
differently. 


© 2020 Springer Nature Limited. All rights reserved 


That’s not to say technology isa silver bullet 
— building an inclusive environment requires 
asustained commitment from lab leaders and 
members, on multiple levels and using many 
techniques. And no amount of technology 
can erase bullying, discrimination and other 
bad behaviours from the workplace. But these 
tools are helping many inclusive-minded group 
leaders to transform research from an isolated 
pursuit into a more open, collective exercise. 

“Any technology that increases commu- 
nication in a way that is non-threatening, is 
beneficial,’ says Taylor. 


Messaging equality 

The University of Helsinki’s Computational 
Field Theory Group, which studies what hap- 
penedinthe10 picoseconds after the Big Bang, 
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for instance, uses an open-source messaging 
platform similar to Slack to share data and dis- 
cuss results with collaborators. But the group 
takes the tool, called Mattermost, even further: 
it uses it as a forum for nearly all group commu- 
nications, from discussing research projects to 
organizing spontaneous outings and lunches. 
This keeps discussions open and transparent 
toall ofthe group’s 20 or so members and their 
colleagues. Members frequently add notes 
from face-to-face conversations as atranscript 
record and to keep everyone in the know. 

David Weir, a physicist inthe group, explains 
that workplace surveys had revealed internal 
communication problems in the Helsinki 
physics department, and showed that mem- 
bers, particularly women who didn’t speak 
Finnish fluently, often felt isolated. “I do think 
[Mattermost] helps lower the threshold to 
people participating,” he says. 

Saga Sappi, a PhD student ina neighbouring 
theoretical-physics group, says the open-to-all 
messaging has madea “night and day” positive 
difference to socialinteractions. And it lowered 
the barrier to getting help, she adds, by making 
it easier to send research questions informally to 
the entire group rather than having atime-con- 
suming e-mail exchange with a supervisor. 


Terms of engagement 


Other tools can also ease communication and 
lower barriers. Juan Gilbert’s computer and 
information-science group at the University 
of Floridain Gainesville, for instance, uses the 
videoconferencing program Zoom to support 
lab members during pregnancy and parental 
leave. Zoom allows them to join lab meetings 
or have consultations when at home — but only 
ifthey choose to. “They want to stay engaged, 
and using Zoom keeps them connected on 
their own terms,’ says Gilbert. 

Brenna Hassett, a physical anthropologist 
at University College London, says such open, 
transparent group communications can provide 
ahealthy counterbalance tothe power dynam- 
ics that naturally exist ingroup meetings led bya 
principalinvestigator or aclosed-door meeting 
between a supervisor and their student. 

When everyone can weigh in on a conversa- 
tion, she says, it helps to guard against misun- 
derstandings or misread cues and adds broader 
context. “You might leave a closed-door meet- 
ing thinking your PhD supervisor hates you, 
when in fact, they just had a reasonable criti- 
cism about your bibliography,’ says Hassett. 

Another advantage to all-group communi- 
cations apps is that they make it harder to say 
“anything even remotely inappropriate”, adds 
Hassett, who was co-organizer of asessionon 
tech tools for gender inclusion at the 2019 
Science Foo Camp conference, held last July 
in Mountain View, California, and supported 
by Nature Research, part of Springer Nature 
(the publisher of Nature). 

But there are downsides, warns computer 
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scientist Kate Devlin at King’s College London, 
who co-organized the session with Hassett. 
“Iwonder how many brakes are put on conver- 
sations because of the transparency?” 


Research co-op 


Another way of flattening group hierarchies 
is to make research both open and collective. 
Marine-data scientist Julia Stewart Lowndes 
is an advocate of the open-science movement, 
which espouses open-source software, data 
sharing and transparency in data analysis 
and publishing. “But you need ateam culture 
and welcoming environment for people to 
feel safe” about sharing and discussing data 
freely, says Stewart Lowndes, who works at 
the National Center for Ecological Analysis and 
Synthesis in Santa Barbara, California. 

One way to build this trust, she says, is to 
make codes of conduct or lab values public 
so that everyone has shared expectations. For 
instance, one such document from an event 
that Devlin helped to run states: “We believe 
that everyone has the right to be ina safe and 
welcoming environment.” The information 


“Any technology that 
increases communication 
in away that is non- 
threatening, is beneficial.” 


might also help to recruit scientists from more 
diverse backgrounds, if they see that a group 
welcomes different perspectives and has values 
that align with their own. (See ‘Low-tech tips for 
inclusivity’ online at go.nature.com/2gdubnt.) 

In Stefania Milan’s team, doing science col- 
lectively isn’t just an attitude, it’s an operat- 
ing principle. Her group at the University of 
Amsterdam studies the evolution of political 
activism in the age of big data, and each of the 
dozen members has an equal say in both group 
and research decisions. 

After receiving a five-year grant to start 
her group, she and the team spent about 
18 months crowdsourcing and developing a 
set of lab values by which they operate, along 
witha research questionnaire and the technol- 
ogy infrastructure needed to conduct their 
research securely. Milan could have completed 
that work more quickly on her own, she says, 
but her international and cross-disciplinary 
team helped to forge a stronger toolkit. 

The lab members pool their data and do 
team analyses using open-source software, 
including a custom code-sharing system 
similar to GitHub, which is paired with the 
ownCloud cloud-storage service. This infra- 
structure allows them to write and code 
collaboratively and to share calendars and 
documents while storing the data ona private, 
protected server. The whole system is accessi- 
ble by team members who live abroad or work 
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from home, so everyone can joininthe team’s 
weekly coding session. 


Supporting learning differences 


For regenerative pharmacologist Sara Rankin, 
inclusivity means accommodating a neuro- 
diverse population. Rankin found out late in 
life that she has dyslexia and dyspraxia: learn- 
ing differences in the way her brain processes 
written words and organization. “People work 
and think in different ways and you've got to 
allow them to do that,” she says. 

Rankin’s university, Imperial College London, 
invests in a suite of 16 inclusive software pro- 
grams to help students and staff who have learn- 
ing differences or for whom English is a second 
language. It includes programs such as Gram- 
marly, to check spelling and grammar, as well as 
toolsto help researchers to craft writing or talks 
in non-conventional ways. Audio Notetaker, 
for instance, records audio during lectures and 
syncs it with typed notes, while the speech-to- 
text software package Dictation.io helps those 
for whom dictating papers or presentation 
slides comes more easily. 

Rankin uses the idea-mapping program 
MindViewtosee her notes on methods together 
with data charts, images and literature asso- 
ciated with a project — all on one screen. “Ata 
single click, that can be converted into a Word 
document” as a rough draft of your paper, she 
says. “That’s amazing if you area visual learner.” 

Gilbert suggests that leaders take their cues 
from their team when adopting tech tools such 
as Slack and WhatsApp. He says that many 
younger researchers view e-mail as formal and 
cumbersome. “They are already using these 
apps in their daily lives, so I wanted to incor- 
porate my lab into that,” he says. “And I geta 
better, more productive workforce that way.” 

That said, no matter how inclusive an envi- 
ronment might be, there is always room for 
improvement. Simple technologies can provide 
anonymous mechanisms for making com- 
plaints, reporting inappropriate behaviours 
or asking questions without the fear of bias or 
retaliation. For neurobiologist Leslie Vosshall at 
the Rockefeller University in New York City, an 
anonymous lab survey proved transformative, 
she says. The responses prompted Vosshall to 
make lab meetings more focused and journal 
club meetings more interactive, and revealed 
that there was uneven access to lab resources — 
a problem that was easily solved with a shared 
online folder for lab protocols. 

Embracing that spirit of sharing solutions, 
resources and power can go along way towards 
transforming laboratories into welcoming, fair 
workplaces. “Science is a social process. We 
do it in teams and we doit best when we area 
diverse, respectful team who care about each 
other,” says Weir. “It’s also more fun that way.” 


Kendall Powell is a freelance writer in Boulder, 
Colorado. 
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san evolutionary biologist, my 
main interest is in understanding 
how birds and bacteria evolve 

when they encounter unfamiliar 
environments. For the past 22 years, 
Ihave studied the dark-eyed junco (Junco 
hyemalis), a sparrow with white outer tail 
feathers. I’m especially interested ina 
population that has settled at the campus 

of the University of California, Los Angeles 
(UCLA). These birds had migrated 

70 kilometres from the Angeles National 
Forest or the woodlands of California’s 
Santa Monica Mountains. 

Most of my work is done here on the 
campus of UCLA, where 1 am in this picture. 
also go with my students into the Californian 
mountains to see howjuncos’ appearance 
and behaviour vary between their natural 
and their city habitats. There is something 
so joyful, so wondrous, about going into the 
on-campus ‘field’ to study birds — sometimes 
I feel | know alittle secret about the natural 
world, right here. It makes my heart sing. 

Juncos are unusual in that, even in urban 
settings, they build their nests on the 
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ground. When | see small shrubs or dense 
ground cover in some place where people 
won't tread, I always think, ‘Oh, that'd bea 
good place to put a nest. If seea tall tree 
with branches jutting out, I think that’d be 
a good place for ajunco to sing. 

When this photo was taken, my students 
and I had put up a ‘mist net’ of fine polyester 
mesh to capture ajunco, and we were trying 
to lure it by loudly playing recordings 
of other juncos advertising themselves, 
attracting mates or defending their territory. 
In my research, I’ve found that city juncos are 
less aggressive and fearful of humans than 
are mountain juncos. They also have shorter 
tails and wings, and less white in their tail 
feathers, and they breed more often. 

It’s amazing that I can walk from one 
building to another and see our birds. It’s 
a reminder that nature isn’t something we 
go to — it’s where we are. For me, that’s 
very inspiring. 


Pamela Yeh is an evolutionary biologist at 
the University of California, Los Angeles. 
Interview by Josie Glausiusz. 
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People at a social-welfare centre interact with the service robot ‘A Tie’ in Hangzhou, China. 
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Researchers are scrambling to meet the demands posed 


by arapidly ageing population. By Sarah O'Meara 


hen computational biologist 

Jing-Dong Han moved back to 

China from the United States 

in 2005, she found it difficult to 

convince other scientists there 
that her research goals were serious. 

Han wanted to focus on the science of 
ageing, an area that did not yet have its own 
funding stream in China. “Many of my peers 
told me this was not a real scientific research 
field,” she says. 

Han’s background in biomedical science, 
combined with her work asa software engineer 


and in computational biology, made her 
confident that she could prove them wrong. 
She wanted to deploy the same cellular 
approaches that she had used in the United 
States to investigate cancer to probe how the 
body ages. 

“I hoped this information could potentially 
be used to help people better understand and 
manage their health needs as they grow older,” 
she says. 

China’s population is getting old. A decline 
in the birth rate between 1980 and 2015 
as a result of the nation’s one-child policy, 
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combined with increased longevity, will see 
the proportion of people aged 65 and older 
triple between 2006 and 2050, to reach almost 
one-quarter of the population. 

The consequences of such a sharp 
demographic change will be substantial. One- 
fifth of the population already has neurode- 
generative or neuropsychiatric disorders, and 
the number with age-associated diseases such 
as stroke, Alzheimer’s, Parkinson’s and lung 
cancer will rise (see ‘Age-associated diseases’). 
By 2030, the number of citizens with at least 
one long-term disorder is expected to be three 
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times higher than it is today (see go.nature. 
com/3dd66bw). 

For Han, the realization among policymakers 
in the time since she’s moved back to China 
that the country’s health-care system needs 
to change to cope with the unprecedented 
demand from elderly citizens has provided 
momentum for her work. In around 2015, 
funding agencies in China started to enlist 
scientists to discuss and propose strategies 
to address the challenges from population age- 
ing, she says, and that has led to the initiation 
of dedicated grant tracks and an ever-growing 
ageing-research community. 

Han spent the first five years back in China 
exploring how genes interact during the age- 
ing process, before being asked in 2010 tolead 
a pilot project in Shanghai. The Partner Insti- 
tute for Computational Biology, a collabora- 
tion between the Chinese Academy of Sciences 
(CAS) and the German Max Planck Society, 
uses computational methods to extract pat- 
terns from huge gene and protein data sets 
to better understand how our cells and body 
age, among other goals. 

In 2015, after publishing numerous stud- 
ies about ageing in worms and mice, Han 
and her team published their first work in 
humans, based on a relatively small cohort 
of 300 people. It showed how algorithms 
can use 3D facial images to predict how fast 
a person is ageing (W. Chen et al. Cell Res. 
25, 574-587; 2015). She also secured a grant 
of 1.9 million yuan (US$270,000) from the 
Ministry of Science and Technology (MOST). 
Then, last July, she moved to Peking Univer- 
sity in Beijing, and now hopes to publish work 
on 5,000 individuals. 

Ageing research is still very much an 
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up-and-coming field, says Han, but “even if 
you can’t provide startling results right now 
or immediately describe how it will be applied, 
ageing-related research needs to be done 
urgently in the social context of China”. 


Anew field for old problems 


Between 2008 and 2018, the number of papers 
on geriatrics and gerontology published by 
researchers at Chinese institutions tripled. 

In 2016, the Chinese government 
announced the Healthy China 2030 plan, 
a national policy to address the country’s 
long-term health-care challenges. The policy 
includes a range of interventions to promote 
health in elderly people, such as encourag- 
ing a balanced diet and better fitness, and 
discouraging smoking. 

In the same year, the director of the CAS 


Institute of Neuroscience, Mu-ming Poo, out- 
lined details of the country’s most ambitious 
neuroscience programme, the China Brain 
Project, whose remit includes improving our 
understanding of autism spectrum disorder, 
depression and neurodegeneration (M.-m. Poo 
et al. Neuron 92, 591-596; 2016). 

Then in 2017, the National Natural Science 
Foundation of China (NSFC) announced a 
research project with dedicated funding 
called Organ Aging and Degeneration Mecha- 
nisms, through which Han received 2.4 million 
yuan the following year. And in 2018, MOST 
launcheda programme knownas Active Health 
and Technology Against Aging. 

All these efforts are designed to push the 
development of services and research for sen- 
ior citizens in China forwards, says Weiping Jia, 
whoruns the Shanghai Institute of Diabetes at 
Shanghai Jiao Tong University. 

The government has also funded centres for 
basic and clinical ageing research, including 
the CAS Institute of Stem Cell and Regenera- 
tion and the Aging Research Center at Peking 
University, both in Beijing. 

However, James Kirkland, head of the Robert 
and Arlene Kogod Center on Aging at the Mayo 
Clinic in Rochester, Minnesota, says that more 
commitment might be needed, given the size 
of the problem. 

“I think they’re ona knife-edge,” he says. “Is 
this going to be one of the scientific challenges 
they invest in heavily, or perhaps they'll focus 
on another research area, such as some form 
of newspace exploration? I don’t think they’ve 
made that key decision yet to push this kind of 
systems biology forward asa critical priority.” 

Geriatrics scientists in China are keen to 
tap into the country’s massive and homoge- 
neous population by using large-scale studies 
to generate reliable data on how people age. 


AGE-ASSOCIATED DISEASES 


As China's population ages, the country must treat more people with strokes, among other health issues. 
The proportion of the population aged over 65 is expected to reach 24% by 2050. 


In 1997, 6% of 
the population 
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By 2017, 10% of 
population was 


over 65. 


In 2007, 8% of 
population was 
over 65. 
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The nation has already prioritized the 
use of big data and artificial intelligence in 
medicine. Its Healthy China plan, for exam- 
ple, willinvolve building more data centres to 
collect and combine medical records that are 
currently spread across separate departments. 


National investment 


In 2018, MOST awarded 15 million yuan to 
Piu Chan, director of the Chinese National 
Clinical Research Center for Geriatric Dis- 
orders at Xuanwu Hospital, Capital Medical 
University, in Beijing. The grant was so that 
Chan, a neurologist who researches Parkin- 
son’s disease, could use patients’ electronic 
medical records to build a national database 
of information about neurodegenerative con- 
ditions. Given that nearly 60% of people with 
Parkinson’s live in China, that could become 
a valuable resource for researchers around 
the world. 

Chan is now working with biologist Gang 
Pei, who is at Tongji University in Shanghai 
and researches drugs that could reduce the 
effects of Alzheimer’s disease on cognitive 
decline, to get the database off the ground. 
It can be tough to convince different govern- 
ment offices to share information, Pei says, 
because there is little precedent for this kind 
of national collaboration. “We need to all 
agree onthe rules of how dataare shared, how 
they will be used, what form they comein and 
who will receive the benefit of any scientific 
achievement,” he says. 

Pei would also like to see China develop a 
repository for tissue samples that can be used 
by neuroscientists and clinical researchers 
throughout the nation, as well as worldwide, 
but says that’s likely to be stymied by the 
stigma attached to organ donation in the 
country. 

“We don’t have this tradition. We think the 
body is sacred. So we need to teach people 
why they should agree to donate their body 
to medical science for the greater good.” 

Guanghui Liu, astem-cell researcher at the 
CAS Institute of Stem Cell and Regeneration 
in Beijing and president of the Chinese Soci- 
ety of Aging Cell Research, says that as well 
as overcoming cultural challenges, research 
areas will need to be integrated better if the 
field is to make progress. “In China, we need 
cross-disciplinary integration of areas such as 
biomedicine, physics, chemistry, engineering, 
bioinformatics and artificial intelligence,’ he 
says. “That’s probably our greatest challenge 
to promote the development of the field.” 


Sarah O’Meara is a freelance journalist based 
in London. Additional research by Kevin 
Schoenmakers. 


Xiaoming Zhou 
Neurobiologist 


Xiaoming Zhou is a neurobiologist at 

East China Normal University in Shanghai. 
Here he speaks to Nature about his 
research into age-related hearing loss, and 
explains why he hopes that brain training 
could help to lessen declines in sensory 
perception generally, and so ward off 
neurodegenerative diseases. 


Whatis your current research focus? 
We want to better understand the 
neural basis for why a person's hearing 
function declines as they grow older. 
For example, we have performed 
research to see whether we can reverse 
age-related changes to the auditory 
systems of rodents. 

We gave the animals a set of tasks, such 
as learning to discriminate between sounds 
of different frequencies or intensities. These 
exercises caused the rodents’ hearing to 
improve, and also promoted changes to the 
hippocampus, a part of the brain structure 
closely associated with learning and 
memory. 

The relationship with the hippocampus 
suggests that new kinds of brain training 
might help to attenuate our declines in 
perception and other brain functions, such 
as learning and memory, as we grow older 
—and so have the potential to stave off 
neurodegenerative diseases. 


How is ageing-related science developing 
in China? 

As has happened in the rest of the world, 

a rapidly ageing population has brought 
significant concern to policymakers. 
However, as far as | know, only a few 
neuroscience laboratories in China are 
specifically focused on learning more about 
the underlying mechanisms that cause 
changes in brain function as we age. This 
is despite the fact that such research could 
have a considerable impact on the welfare 
of older people in the future. 

Nevertheless, the volume of research 
carried out in China in this area has 
increased dramatically — probably because 
of the huge growth of the country’s 
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scientific community as a whole in recent 
years, but also because funding for 
neuroscience research has risen. 


In what areas of ageing-related research is 
Chinese science making most progress? 
| think we could make the most progress 
in our research on Alzheimer’s disease, a 
neurodegenerative disorder that causes 
difficulties with memory, cognition and 
behaviour in many older people. It is one 
of three brain-related diseases that will 
be the focus of a forthcoming national 
neuroscience initiative called the China 
Brain Project. 

So it is foreseeable that a considerable 
number of Chinese scientists in related 
fields, such as neuroscience, medicine and 
artificial intelligence, will work together as 
part of this plan to study the mechanisms of 
Alzheimer’s. Of course, it’s hoped that these 
studies will receive huge financial support 
from the central government, and this 
should help to propel our research. 


What needs to happen for China to make 
greater leaps in scientific research? 

The environment for scientific research, 
including the management of staff, 
academic evaluation and financial support, 
needs to be further improved. For example, 
researchers spend a lot of valuable 

time doing administrative work, such as 
making applications for new instruments 
and organizing their expenses — time 

that would be better spent on scientific 
research, | think. 

Fortunately, the management team at my 
university has noticed this problem and has 
made a big effort to solve it, and the situation 
is now gradually improving. In the field of 
brain science, many Chinese scientists also 
need more opportunities to collaborate with 
our international peers to help advance our 
research. Patience and persistence are very 
important, too, of course. 


Interview by Sarah O'Meara. 


This interview has been edited for length 
and clarity. 
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